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EXECUTIVE SUmiARX 


As Geobase Information Systems increase in number* size and complexity, the 
format compatability of satellite remote sensing data becomes increasingly more 
important. Because of the vast and continually increasing quantity of data avail- 
able from remote sensing systems the utility of these data is increasingly de- 
pendent on the degree to which their formats facilitate, or hinder, their incor- 
poration into Geobase Information Systems. To merge satellite data into a geo- 
base system requires that they both have a compatible geographic referencing 
system. The cost of converting satellite data to a compatible format may be too 
large for many users. Greater acceptance of satellite data by the user com- 
munity will be facilitated if the data are in a form which most readily cor- 
responds to existing geobase data structures. 

While most users agree that a common format is desirable the actual struc- 
ture of that format is the subject of much debate. However, the following gen- 
eral points appear to represent a consensus of the user community: 


* A procedure for windowing and reformatting Landsat data so that it 
could directly overlay USGS map quadrangles would increase the utility 
of remote sensing data in geobase information systems. If this pro- 
cedure is initiated it will result in the attraction of new users, 
reduced handling and processing time for all users, and an overall cost 
savings. 

* New geographic data bases designed to provide the public with Information 
and serve the needs of federal and State agencies should be structured to 
offer maximum flexibility for conversion by external users. Both polygon 
and gridded approaches to a data base structure must continue to be eval- 
uated. 'Both types may need to be supported by providers of image pro- 
ducts. 

* A workshop bringing together representatives from local, county, regional, 
state and national levels of government is recommended. The workshop 
should examine both the recommendations given in this report, and the 
development of upwardly compatible geobase information systems between 
hierarchies of government. 

* At the state level a central agency should examine a common format phi- 
losophy in respect to its impact on the geographic information needs of 
the state. A formal procedure should be developed to support implemen- 
tation of a common format at all agency levels. 

The conference addressed a number of specific topics and made the following 
recommendations: 

GENERAL FORMATS FOR FUTURE DIGITAL IMAGE PRODUCTS 

Two distinct families of imagery should be available to users as standard 
products: 



* Uncorrected video data, either in completely unaltered, "raw" form, 
or radiometrlcally calibrated but not destriped, but with substantial 
header and annotation Information, 

* Fully processed data, which has been radiometrically corrected, In- 
cluding destrlping, and geometrically rectified to a UTM projection. 

Also; 

* Processed data should be available in certain other projections and 
formats as "'•pedal -order" products, for which a somewhat longer order 
turnaround would be acceptable. 

ANNOTATION 

The best and most complete annotation should be made available to users of 
either raw or reformatted satellite data. All forms of annotation currently 
provided should continue to be provided. In addition, it is recommended that 
the following information be supplied with all digital image data: 

* Type of projection, if applicable. 

* Sampling Interval in meters on the ground. 

* Up-to-date algorithms and coefficients for converting digital radiance 
numbers to actual space upwelllng radiances. 

* Several ground control points Identified in image and standard geo- 
reference coordinates. 

* More accurate geodetic grid ticks and nadir/ format center coordinates 
(preferably designated in latitude and longitude). 

* More accurate spacecraft positional (altitude and attitude) information. 

* More spatially specific cloud coverage estimates. 

RECTIFICATION 

The rectified imagery should meet the following specifications: 

* The map projection employed should be conformal and the pixels should 
be aligned in an easting direction, amenable to Cartesian referencing. 
Universal Transverse Mercator was the projection preferred by users 
attending the conference. 

* Rectified imagery should be indexed to existing USGS 7-1/2 minute 
quadrangle topographic maps, whereon sample locations should be fixed. 

* Resampling should be done only once and should either preserve the nom- 
inal spatial resolution of the sensor, or, if confirmed by further 
study, transform to one of a series of standard resolutions, e.g. bi- 
nary (16 , 32 , 64, 128 ... meters), or a sequence preferred by some 
users of 10, 25, 50, 100, 250 ... meters (10/25 series). 

* Projections other than the recommended UTM standard should be available 
upon request, but the transformations should be done from the raw data 
to avoid resampling data more than once. 
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In terms of assessing the effect of Image rectification on Image data accuracy 
the following Is recommended : 

* Further Investigations should be undertaken to determine the effects 

of various resampling algorithms, including double resampling, on radio- 
metric and spatial fidelity, and classifier performance. 

* The rectification algorithms used should be provided with the data to 
the user community and algorithms for forward and Inverse transforms 
between a variety of standard projections should be available for dis- 
tribution to users. 

SPATIAL RESOLUTION OF THE STANDARD PRODUCTS 

* "Raw" digital data should preserve the nominal spatial resolution of 
the imaging sensor. 

* The "corrected" digital data should be resampled at an interval selected 
from the binary series (i.e., 2 n meters) or the "10/25" series. For 
Landsat-D, a suggested resampling interval would be 25 or 32 meters. 

* Data compression techniques which irreversibly degrade spatial resolu- 
tion should not be employed. 

* Precise evaluations of the effects on spatial resolution of any resam- 
pling algorithms employed, should be made available, either by broader 
dissemination of existing studies or the initiation of new ones. 

RADIOMETRIC RESOLUTION 

* NASA should continue to improve their calibration procedures, and make 
all algorithms and constants available to the users through EDC {EROS 
Data Center). 

* Data in which striping is present must be available to the users in 
uncorrected form. 

* Pathwise calibration on an entire strip should be employed, to minimize 
scene-to-scene and along-track wlthin-scene variations. 

* Future video data should be represented as 8-bit, unsigned binary 
integers in the range 0-255. 

* Further investigation (and/or broader dissemination of existing data) 
is needed regarding the effects of resampling on radiometric fidelity, 
and on the feasibility of applying atmospheric corrections at the 
system level. 

SPECTRAL RESOLUTION OF SPACE SENSORS 

* Future operational sensor systems should (to the degree proven necessary) 
provide spectral continuity with multi spectral bands already operational 
in the user community. 

TEMPORAL RESOLUTION 

* NASA should provide for a more frequent overpass Interval than the cur- 
rent 18 days, either by utilizing different orbital parameters or by 
the recent practice of staggering the orbits of 2 or more Identical 
satellites. 



* 

Multiple coverage of path overlap areas should be retained. 

* Turnaround time on orders of Image or digital products should be dras- 
tically reduced, to no more than 7 days from satellite overpass, with 
a preferred 2-day turnaround for tlme-crl .leal uses, 

* Data archiving should permit preservation of good Imagery of areas where 
cloud cover Is a severe problem. 

CCT FORMATS AND DATA STORAGE 

* Three logical structures, BIP (band Interleaved by pixel), 6IL (band 
Interleaved by line) and BSQ (band sequential) should be available. The 
current structure, BIP2 (band Interleaved by pixel pairs) Is considered 
the least useful format. 

* The logical record structure of the Image file should Include substan- 
tial annotation for each record. 

* Physical blocking of data on a tape should allow for I/O buffer limita- 
tions of minicomputers. 

* Any data compression techniques employed should be reversible to the 
level of sensor noise. 

* An appropriate Federal agency, probably EDC, should distribute FORTRAN 
I/O subroutines for performing common manipulations of Imagery in what- 
ever formats are ultimately supplied. 

* 1600 bpi CCT's should continue to be the principal mode of data exchange 
until higher-density storage devices become more wide-spread. 

DETERMINATION OF USER DEMAND PRICING SCHEDULES 
AND ABSORPTION OF PREPROCESSING COSTS 

* Better estimates of the needs of the users of satellite data are re- 
quired. The needs of State and local governments for several levels 
of processing of the data, and for various software Implementations, 
seem poorly understood. NASA, or some other Federal agency, should 
undertake to provide more detailed estimates of these needs. 

* Both the demand and the pricing schedules should be jointly determined 
by analytical techniques such as Demand Revealing Processes. These 
determinations should be made for various product combinations and 
should be performed on a regularly scheduled basis. 

* A portion of the preprocessing costs should be absorbed by the Federal 
government, while the variable costs should be recovered from the aggre- 
gation of users. This federal Investment will maximize the benefits 
obtained from the use of satellite data byovercoming the threshold nec- 
essary to Initiate the diffusion of a technological Innovation. In 
economic terms the federal subsidy should also be viewed as an invest- 
ment aimed at maximizing the present value of the stream of net benefits 
obtained from the use of satellite data. 

* NASA, the EROS Data Center of the U.S. Department of the Interior, and 
such other federal agencies as should properly be involved, examine the 
division of responsibilities and costs between them, in order to imple- 
ment the preceding recommendations. 
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SOFTWARE PORTABILITY 


* Where possible programs for general distribution should be written 
in FORTRAN. Departures from the ANSI X3.9 1966 standard should be 
isolated in subroutines and clearly documented. Especially, pro- 
grams should Isolate all Input/output and bit manipulation portions. 
Implementation of this recommendation requires that programmers read 
and become familiar with the Standard. A new “Draft proposed ANS 
FORTRAN" is in circulation, but it will probably be several years 
before compilers meeting it will be widely available. 

* It Is recommended that NASA and EDC develop Image Processing and 
Analysis Software Standards to ensure that future software development 
is In as Internal ly-cons Is tent a form as possible, and to ensure that 
portability Is enhanced. 

LANDSAT-D ' 

* Data from the Thematic Mapper (TM) should be available in two formats: 

- uncorrected (radiometrically calibrated), and with extensive annota- 
tion 

- radiometrically corrected and geometrically rectified to a standard 
(UTM) map projection, scan line orientation, and pixel size 

* Since TM data will require some form of resampling, especially to 
coarser spatial resolutions employed by existing data bases, NASA should 
encourage the development and exchange of information on the effects of 
resampling. 

* Data compression for the TM is probably unavoidable, but should employ 
methods which are reversible at least to the level of rated sensor noise. 

* The logical structure of distributed TM data should be band-interleaved- 
by-llne (BIL), with optional availability of band-sequential (BSQ), and 
band-interleaved-by pixel (BIP). 

* NASA should be aware of technological developments which when implemented 
by the user community, might affect the utilization of TM data such as 
Increasing availability of high-density storage devices and array 
processors. 

It was the conclusion of the conference that NASA, EROS Data Center, and 
other pertinent federal agencies, examine all of the preceding recommendations 
to determine their feasibility and implementation costs. This should be done 
to determine the benefits to the entire system of users, the division of respon- 
sibilities and costs between the agencies involved, and the contributions to be 
expected from users. 

The following pages present background material on the problems of inte- 
grating remote sensing data into geobase information systems, as well as a brief 
discussion of existing remote sensing data and digital processing techniques. 
Following these are detailed discussions of each of the recommendations summar- 
ized above. 



INTRODUCTION: A WORKSHOP TO EXAMINE GEOBASE INFORMATION SYSTEM 
IMPACTS ON SPACE IMAGE FORMATS 


PURPOSE OF THE WORKSHOP 

A NASA-sponsored workshop to examine Geobase Information Systems Impacts on 
Space Image Formats was convened at LaCasa de Marla* Santa Barbara, on September Il- 
ls, 1977. This workshop brought together some fifty NASA, other federal agency 
state, university, and private sector users and developers of Geobase Information 
Systems. These participants were asked to examine the potential Impact of Geobase 
Information System formats and procedures on space Image formats for Landsat-D and 
other future satellite systems. 

The rationale behind the workshop was that Geobase Information Systems, would 
soon be so pervasive— If they were not already so — as to markedly Influence, or 
Indeed potentially dictate, the formats of computer compatible tapes and other data 
products from space imaging systems. A thesis leading to the workshop was that If 
this Indeed were the case, the acceptance of space remote sensing Imaging from 
Landsat-D, and Imaging radars in Space Shuttle, and other systems would be markedly 
conditioned by the degree to which the Computer Compatible Tape (CCT) format and 
other formats from these Imaging systems facilitated, or hindered, their incorpra- 
tion Into Geobase Information Systems. 

An underlying finding of the workshop was that the development of integrated 
resource monitoring systems characteristically required the parallel development of 
several technologies in addition to that of remote sensing. Acceptance of remote 
sensing by the user community thus may hinge on model development, low cost digital 
analysis, or some other parallel technology or conceptual development, or as in the 
case of this workshop upon the development of Geobase Information Systems, and their 
format compatibilities with space Image products. 

The Reformatting Problem 

The front-end cost of adjusting Landsat 1 and 2 to a variety of formats when 
considered as a national aggregate Is substantial. The possibility of endless re- 
formatting of future space imaging CCT's--as a result of following the earlier 
Landsat 1 and 2 designs which are not readily compatible with user Geobase Systems-- 
looms as a major burden on NASA space imaging systems. A number of authors have 
argued over the last several years that state and local agency remote sensing (amongst 
others) will not really mature until it is fully integrated into Geobase Information 
Systems and has been used routinely for a number of years (Simonett, 1976). In the 


call for attendance at the workshop It was stressed that If these arguments are 
correct. It is Important for NASA to examine these interface, compatibility . 
repetitive front-end and future re-formatting problems to find if cost effective 
satellite image formats can be provided fay NASA and/or federal agencies, which 
facilitate or even advance the use of both satellite and Geobase data . 

The members of the workshop were divided into five panels. Each panel ad- 
dressed the same topics and concerns. At the end of the workshop the Chairmen of 
the respective panels met with the organizers and Mr. Frederic Billingsley of NASA 
headqiarters to review the several panel reports and the discussion at the general 
meeting at the end of the conference, and to coordinate the recommendations. Much 
of the proceedings of individual panels were tape recorded and transcribed, in addi- 
tion to the documentation given in the individual panel reports. Using these mat- 
erials and further items prepared by the organizers, this report has been prepared. 
It documents for NASA and other federal and state agencies the necessity for develop 
ing standard format(s), the likely consequences of so doing, both for NASA and the 
other groups, and gives the underpinning technical and managerial questions, analysi 
documentation and discussion. 

Topics and Questions Considered at the Workshop 

The topics and questions which were examined by each panel and the Review Panel 
were as follows: 

Individual Contribution from all Attendees 

Comments were requested from all, attendees on the following topics: 

1. Experience of problems in working with NASA CCT's; Summary of procedures 
in geobase systems with which NASA data was Integrated. 

2. Reactions to the suggestion(s} on NASA tape format(s) in each panel. 
Details on the impact on present systems, the costs of converstion, 
and the benefits, if any, from conversion and the use of a future 
standard format or formats. 

Panel Recommendations to NAS A /EROS Data Center on the Following Topics 

1. Recommended format{s) for all future CCT products. 

2. Questions for future working groups to address as a result of the 
above recommendation(s) . 

3. NASA/EDC positions on absorbing pre-processing and reformatting 
costs. 

4. Future software development anticipated or needed and NASA/EDC 
roles in such development. 

Panel Suggestions for Federal and State Agencies 

1. Common formats for future data bases. 



2. Pros and cons of connon format philosophy. 

3. Software transportability. 

Supporting Documentation with Respect to; 

1 . The dimensions of the problem for: 

Large federal data base developers/users 
State agency users 
University researchers 
Private sector users. 

2. Key references to published, unpublished reports, papers, 
memoranda. 

3. Existing transportable software with brief notes on special 
attributes. 

Special Technical Questions which were Examined 

A considerable number of technical questions underpinned the workshop as 
follows: 

1 . Annotation required/desired : 

a. Address! blllty of Individual pixels 

b. Scale factors 

c. Ground Control Points 

d. Tick marks 

2. Considerations on reference systems and projections: 

a. Requirements for latitude and longitude reference 

b. Requirements for Space oblique Mercator, Universal Transverse 
Mercator, Lambert Conformal Conic, or other projections 

x. Requirements for state plane coordinate grids 

d. Requirements for new scales/grids for metrication 

3. Pixel layout on the ground : 

a. Cardinal directions, or in projection grid directions 

b. Spacing considerations 

4. Effects of sampling : 

a. Double sampling 

b. Nyquist sampling 

c. Other sampling strategies 

5. Questions on geobase systems: 

a. Data storage considerations with respect to NASA and other 
formats 

b. Optimal geocoding formats in relation to space images 

c. Format family philosophies 

d. Compatibility considerations for NASA digital Image products 
with respect to census and other major national and state data 
bases 
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6. Effects of present and workshop-reconroended format(s) on : 

a. Radiometric and geometric accuracy 

b. Temporal registration of data 

c. Band to band registration 

d. Precision desired/achievable with respect to latitude/ longitude 
per pixel 

e. Considerations on framing versus continuous Image strips 

7. Various present and future format cost implications : 

a. To NASA 

b. To Federal and State users 

c. To Individual users 

d. To U.S. users In the aggregate 

e. To foreign users and ground stations 

8. Feasibility tests : 

a. Recommendation for experiments with MSS re-sampling 

b. Calibration 

9. Software Portability : 

a. Formats and word lengths for different types of computers 
and their different word lengths 

b. Packing of information from more than one band into single 
words, and its effects on computers with different word lengths, 
and its adaptability to variable numbers of channels 

c. Possibilities for program transportability, when words 
have to be unpacked, and potential for increased use of 
languages like PL/1 or COBOL, which have bit or byte manipu- 
lation features as part of the standard language 

10. Hardware Considerations : 

Formats with respect to expected advances in linear processers, 

array processers, parallel processers, hard wired devices, graphics 

display and retrieval devices, computer architecture, mass storage 

devices, etc. 

Questions on the LAWS AT -D Thematic Mapper 

1 . Should the data from the Thematic Mapper (LANPSAT-D) be ; 

a. Left alone; if so, what header information is desired? 

b. Reprocessed for geometric improvements; if so, into what 
format(s), pixel spacing, and directions? 

2. Contributions on re-sampling : 

a. Does significant(! ) degradation of radiometric quality occur 
as a result of re-sampling? 

b. Do the advantages of re-sampling for many users outweigh losses 
of radiometric quality and, if so, why? 

3. Data Compression of Thematic Mapper : 
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Can we live with data compression from the Thematic Mapper? 

(Some may be necessary because of the order of magnitude Increase 
of the data streams over the Multi spectral Scanner (M.S.S.), 

What are the options and merits of ‘eachV 

a. Compression methods (on board)? 

b. Sampled (alternate rows and columns)? 

c. Cther methods? 

4. How should the data from the Thematic Mapper be structured, and 
why? 

a. Band sequential? 

b. Band Interleaved? 

c. Fully sequential? 

5. What options should be examined for the physical structure of 
tape delivered to users? 

a. CCT, 1600 or 6250 bits per inch? 

b. CCT plus HDDT (High Density Digital Tape* greater than 6250 bpl )? 

c. HDDT (High Density Digital Tape)? 

d. Other? 

6. Hardware Advances : 

What hardware advances are relevant in the next five years which 
impact on questions 3.* 4.* and 5.* above? 

In order to set the workshop recommendations in an appropriate context* 
the material of this report is organized In the following manner. First, some im- 
portant characteristics of geographical information systems are presented, using a 
number of well-known systems employed in the U.S. Next, the incorporation of digital 
remote-sensing data into such systems Is examined* along with the development of geo- 
base information systems which are an outgrowth of digital image processing systems. 

Following this material on information systems the present situation with re- 
spect to existing digital remote sensing data Is reviewed. Present products, data 
handling techniques, supplier processing and user processing are briefly described 
and discussed. 

With both the preceding sections as background, the final section of the report 
gives the recommendations from the workshop, along with supporting discussions and 
analysis. The Executive Summary at the beginning of this report contains the 
essence of the three sections. 


ORIGINAL PAGE IS 
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GEOGRAPHICAL INFORMATION SYSTEMS 


SOME IMPORTANT CHARACTERISTICS 

A great deal of environmental data are currently available to 
decision makers, the increasing trend Is to Incorporate these data In a 
computerized geographically oriented information system. This Is being 
dene at the national level, at the state level, at the county level, at the 
municipal level, and by private firms. There are 50 states, nearly 1,200 
counties, oyc»r 50,000 municipalities; and each of these contains many 
agencies and bureaus. The aggregate Investment In these systems can thus 
be seen to be very large. 

The components of a geographical information system include the 
conversion of the data to a machine readable form, structuring of the data 
In a manner capable of being processed, and the preparation of computer 
programs for retrieval, analysis, and/or display of the data. Each of 
these components Is itself complex and extensive literature exists on each 
of these topics. Overviews are presented in Barraclough (1971), Calkins 
(1977), Gardner (1972), Hearle and Mason (1963), and Tomlinson (1973). 

The locative aspects of the data—where the phenomena of concern 
a,re located on the earth--i$ the major part of any geographic information. 
Thus, a great deal of environmental data Is converted to machine readable 
form by being copied from maps. Point phenomena are easily represented by 
geographical coordinates. These coordinates are names attached as labels 
to places on the earth’s surface. Since the turning of places Is easy, one 
single place may have several different names (or aliases) in different 
coordinate systems. Two well known systems are latitude/longitude, and 
Transverse Mercator coordinates. But a house, for example, also has an 
Identification using a street name and number, at least in the European 
culture area. Conversion between these various aliases may be necessary. 
Early computerized urban transportation studies coded as many as sixteen 
geographical aliases for each location o,f interest. Technically the inter- 
conversions between the alternate names which may be given to a single place 
are relatively simple. Unfortunately, the user community is often not 
familiar with these technical details. Good reviews covering these topics 
include Clayton (1971), Gale (1969), Maling (1973), Moyer and Fisher (1973), 


and Werner (1974). 

A number of additional difficulties arise because of this problem 
of aliases. Although the conversions are generally simple conceptually* the 
actual computations may become quite extensive when the data volumes are 
large, as is typically the case In geographical Information systems. Be- 
cause the earth Is round, and maps are flat. It Is necessary to use map 
projections. Map projection coordinates are thus Introduced as an additional 
family of aliases. These, also, are not technically difficult but are 
largely unfamiliar to the user community, and therefore cause a great deal of 
confusion. The Transverse Mercator system, for example, 1$ used both for 
U.S.G.S. 1:250,000 maps and as a system of state plane coordinates, In 
slightly different variants. The Lambert Conformal Conic projection Is 
used for a different set of maps (aeronautical charts) and also for state 
plane coordinates, also In slightly different variants. State plane co- 
ordinates have a legal status for property description in many states and 
are in one sense thus "official, place names". The projection problem can 
only be avoided by using latitude and longitude coordinates, and these are 

i 

generally the place names used for points in International affairs. 

For local— municipal , county, and sometimes state— activities the 
latitude and longitude system is usually too awkward. But there Is in- 
evitably an area of interest which lies on, or overlaps, the border between 
two state plan coordinate systems, or two UTM zones. The natural tendency 
is to replace these two systems with one new system, and thus another alias 
is created. Where the problem really becomes vexing is when one obtains 
data from two different sources. These sources could be two different maps, 
or two different agencies. With distressing frequency the geographical 
names given to the same locations by these two sources will be different. 

The merging of geographical files thus becomes frustrating and troublesome 
if the relationship between the various aliases is not understood or not 
known, and expensive in any event. Many of the recommendations made In this 
report attempt to overcome some of these frustrations and expenses. 

Geographical phenomena can also be structured in various ways, and 
this has important implications for the types of problems which are most 
efficiently attacked (Peucker and Chrisman (1975), Schmidt (1978)). The 
two Important structures which have to date been associated with geographical 
Information storage are known as the "topological" and the "grid" systems. 
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The topological structure is based on concepts from graph theory, and is 
most closely associated with the coding of bounded regions, called polygons, 
such as a soil patch, a census tract, a planning district, a watershed, etc. 
The most developed form of this scheme, known as Dual Independent Hap 
Encoding (DIME), has been used by the census bureau to encode the entire 
street system for each of the 200 largest metropolitan areas in the U.S. 

In this scheme each street segment is treated as a directed line, with 
coordinates at the end points, and is associated with an area on the left 
and an area on the right hand side, and with a street name and with a range 
of house numbers. One advantage of this scheme is the quasi -automatic 
topological error checking which it permits. Host, recently hierarchical 
methods and methods permitting nesting of areal phenomena have been described. 
The hierarchical schemes allow upward aggregation. The nested schemes 
recognize, for example, that a tree may be in a park in a city in.... These 
are probably the most realistic but are not yet widely implemented. All 
i/i these schemes describe areas as polygons with a finite number of vertices. 
They generally allow disjoint pieces and inclusions. They differ mainly in 
how they arrange the pointers between nodes, arcs, segments, edges, areas and 
adjacencies. Mary workable schemes are now available, for example, to produce 
maps automatically using the two meters of magnetic tape required to store 
the 46,142 point latitude/longitude description of the U.S. counties. Point- 
in-polygon routines are commonplace for assigning point phenomena to regions; 
e.g. , converting street address to census tract name. Polygon overlay pro- 
grams are also available to convert land use polygons into areal counts by 
census tract polygons. The cost of this file merging procedure usually 
grows as the product of the number of polygons, and is also a function of 
their complexity. It is thus not an exponential problem but is sufficiently 
expensive to cause practical difficulties. Regional planning often consists 
of a concatenation of simple binary decisions which can be caricatured in 
the following manner: "If land slope is not steep and drainage is good and 

a road is nearby. . .then development is permitted." Here the logical inter- 
section of three maps (sets of polygons) is required, and a practical prob- 
lem might require several dozen such, often repeated as the criterion shift 
slightly in the process of political compromise. Thus polygon overlay is 
expensive in most present computer environments, although further develop- 
ment of algorithms and data structures promises to simplify polygon manipula- 
tion. 
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In the grid schemes one replaces the polygons by a set of uniform 
regions, triangles or unit squares. This can be done In either of two ways, 
though the distinction Is rarely noticed In practice. In the first Instance 
one samples at a lattice of points; alternatively one aggregates data within 
the cells of a mesh. Hie advantage of a grid system Is that the adjacency 
relation Is constant, a form of spatial statlonarlty which makes analysis 
much easier. It also lends Itself to representation In a computer. A dis- 
advantage Is that, for a given average resolution, the fit to the data Is 
poorer. It Is obviously an awkward way of storing a road pattern. The choice 
of an appropriate grid size Is difficult, but bears a relation to storage 
requirements. Curiously, variable sized trl angulations, the natural simplex 
In two dimensions, and much used In finite element analysis, have not been 
popular as cells for geographical data storage, probably because automatic 
scanners cannot easily capture the data In this form. Interconversion be- 
tween grids using different lattice sizes requires knowledge of the limita- 
tions Imposed by the sampling theorem, but otherwise causes no great difficulties. 

Computer programs exist to convert between polygonal and grid data for- 
mats, subject to the same sampling limitations as above. As long as the internal 
geographical representation is "complete" one can convert from one structure 
to another. Thus, from one point of view. It does not matter which structure 
is used. The two difficulties are that (a) it Is not entirely clear when a 
geographical data set Is "complete", and (b) when the volume of observations 
is large it may be impractically expensive to convert from one organization 
to another. In merging data given in the two structures the relevant con- 
siderations are (a) which Is less expensive to convert, and (b) which provides 
the greater flexibility for further processing given the alms of the agency. 

The availability of software for spatial data handling has recently been 
reviewed by Marble and others (1977). 

In order to consider how digital remote-sensing data might be 
incorporated into existing geographical information systems we can draw on a 
detailed study prepared by the International Geographical Union Commission on 
Geographical Data Sensing and Processing (Tomlinson, 1976). This report 
reviews the San Diego Comprehensive Planning Organization system (CPO), the 
Minnesota Land Management Information System (MLMIS), the New York Land Use 
and Natural Resource Information System (LUNR), the Oak Ridge Regional 
Modelling Information System (ORRMIS) and a large system In Canada. The 
report, as published by UNESCO In 1976 does not, of course, reflect the 
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exact current status of each system. 

The San Diego COP System records soil maps, land use maps, and 
traffic zone boundaries in polygon form using state plane coordinates. It 
has been used to report on land use by traffic zone as projected by an 
analytical urban growth model , Its use for a study of the impacts of a 
rapid transit system is proposed. The land use within noise contours of an 
airport and within similar contours of proposed airports, has been studied. 

A pilot study for flood control has been completed, using the intersection 
of land use and flood plain polygon data sets. Conversion to a 5,000 foot 
0,524 meter) grid was used to convert traffic zone data for use in a pol- 
lutant dispersion model. The basic land use maps were in part developed 
from aerial photographs, and this was considered less expensive than having 
county agencies maintain an ongoing inventory from public construction 
records. 

The Minnesota (MLMIS) system uses as its basic recording unit the 40 
acre (16 ha) fractional section from the public land survey system. The land 
use, water orientation, and ownership status of the territory of each cell is 
recorded. The land use data were obtained primarily through the interpreta- 
tion of aerial photographs. The variables are referenced by county, by minor 
civil division, by latitude and longitude, by section, township, range, and 
by government lot number. Provision is made to allow entry by 2, 5, or 10 
acre units. At the minor civil division level the data are compatible with 
the Crop and Livestock Reporting Service data and U.S. Census Bureau popula- 
tion data. An application of an analytical model has resulted in the pro- 
duction of a map of the 40-acre average market value of buildings and land, 
and related maps and tables. 

The New York (LUNR) system is the computerized record of an aerial 

survey of the state's land resources, supported by retrieval, analysis, and 

* 

display computer programs. For each one kilometer (0.6 mile) square the 
system records 134 variable, about half of them numerical. Some 140,000 
cells are required for the entire state, each indexed by rectangular 
coordinates, by county, by minor divisions, and by watershed. Some data 
are also recorded by 0.4 hectare (1 acre) cells for the entire state. Data 
can be retrieved from this system by arbitrary polygon (using coordinates) 
cell, or by county, or watershed. The system has been used for public 
planning and site evaluations. 
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The Oak Ridge National Laboratory (ORRMIS) system Is coupled to a 
regional environmental systems model of socio-economic » land-use» ecological* 
and socio-political activities. Approximately 80% of the Input data are In 
the form of maps. Geographically* a nested hierarchy of cells Is used, the 
smallest cell being 3.75 seconds of latitude and longitude (circa 10 meters), 
and the largest Is 7.5 minutes In the same coordinates. This grid is de- 
signed so that the modelling effort can be approached at the resolution 
appropriate to the need or data availability. 

This brief review of four installations is hardly exhaustive of the 
thousands of such systems now In operation or under construction. Our 
conclusion, however, is that digital remote sensing data should be useable 
in all of these systems, but only if it is compatible with the diversity 
of these organizations. To this extent, we believe these to be representa- 
tive descriptions. 

INCORPORATION OF DIGITAL REMOTE SENSING DATA INTO GEOGRAPHICAL 
INFORMATION SYSTEMS 

To merge remotely sensed data with one of these systems requires that 
they both reference the geography in the same way. The pixels of a digital image 
must somehow be made compatible with the remaining Information In the data base. 
Each frame, as currently constituted, of a digital Landsat image contains a very 
large number of pixels, so large in fact that It may quickly overload the capa- 
bility of the user agency. The cost of converting all of these pixels to fit the 
user's system may simply be too large. An intriguing notion is that it may be 
cheaper to convert all of the user's information to fit the image, leading to what 
has been referred to as an Image Based Information System (Bryant and Zobrist, 
1976), That this is on occasion feasible is because a polygon file invariably con- 
tains fewer bits than does the image. The currently available County Boundary DIME 
File for the entire United States, for example, contains only slightly over 50,000 
latitude/longitute pairs, that is, 10 numbers and this is considerably less than 
the number of pixels (3 x 10^) in a single Landsat image. Psychologically, how- 
ever, it may be difficult for the user to abandon his existing data structure just 
to make all his data fit a simple image. The user views the remotely sensed data 
as an addition to existing maps and geographically coded files. The pixel arrange- 
ment on subsequent satellite passes will of course also be slightly different geo- 
graphically and will require another conversion. It would be misleading to argue 
that converting a geographically based Information system to an image based system 
is always appropriate, or that It is always inappropriate. If several agencies use 
one image, then either each Incoming image must be converted to fit each data base 
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or the several data bases must all be converted to fit each new Image. 

This emphasizes the utility of multi temporal Imagery which has already 
been mutually registered by some central agency (such as EOC). The interconver- 
sfons between Image and other data structures would thus be standard for each 
data base system, rather than having to be rederived on an Image- by- Image basis. 
However, image overlay Invariably requires resampling of the radiometric values, 
which is still a point of concern In the user community with regard to its effects 
on classification accuracy and Intra-pixel mixture estimation. 

For users willing to accept resampled Imagery, the sheer volume of data 
contained In a single digital satellite Image remains a problem. This will 
probably result in image reformatting being one of the single greatest data 
processing effort associated with any geobase system attempting to Incorporate 
remote sensing data. 

Greater acceptance of the data by the user community will surely be en- 
hanced if the data are available in a form which most nearly corresponds to ex- 
isting data structures. Although geobase Information systems are still In their 
infancy, and their existing data structures may be far from optimal, the fact re- 
mains that they represent a tremendous aggregate investment of time and effort, 
and< must now participate In operational decision-making processes before they can 
further evolve. To the extent that remote sensing data requires either extensive 
pre-processing, or complete redesign of existing data structures, Its incorpora- 
tion into operational systems will be hindered. It thus behooves the suppliers 
of digital remote sensing data to determine which formats and levels of processing 
maximize the utility of their product to the greatest number of users by minimiz- 
ing the “front-end" costs associated with data reformatting. 


FOOTNOTE: 


1. Even if the latitude/longitude system were used for the recording of 
imaged pixels a resampling would have to be done If the samples are to be evenly 
spaced, since there are fluctuations in the rate of travel of the scanning mirror, 
or else each pixel would have to have attached to it an Individual latitude and 
longitude value. 
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EXISTING DIGITAL REMOTE SENSING DATA 


PRESENT AND PROPOSED PRODUCTS 

Currently a variety of satellite data products are available to users. The 
availability of these products is rapidly increasing as new and more sophisticated 
satellites become operational. Because of this users are many times unaware of 
the total scope and variety of satellite data products that are available now or 
will be in the near future. The following is a brief discussion of the avail- 
ability of data products from some present and future environmental satellites. 
(For a more comprehensive survey on this topic see Hughes, C.L., and Campbell, 

D.C. (1977).) 

Basically there are three types of environmental satellites: 1) earth resource 
satellites, 2) meteorological and oceanographic satellites, and 3) manned satel- 
lites. These operate in a variety of different orbit configurations. Meteoro- 
logical and oceanographic satellites offer a high temporal resolution and low 
spatial resolution and correspondingly low data rates. Earth resource satellites 
generally offer low temporal resolution (Landsat repeat coverage is 18 days), 
high spatial resolution, and high data rates. Most satellites operate in sun- 
synchronous or geosynchronous orbits except for those whose sensors are primarily 
microwave (e.g., Seasat-A) because microwave does not require sunlight for opera- 
tion. Several of the more applications-oriented satellites and their data pro- 
ducts are described below. 

LANDSAT 

The most widely used earth resource satellite is Landsat. Landsat 1 was 
launched in 1972 and ceased operation in January 1978. Landsat 2, launched in 
January 1975 and Landsat 3, launched in March 1978, are currently operational. 
Together, the Lands ats provide coverage at nine day intervals. The primary 
sensor on Landsat has been a MSS (Multi spectral Scanner) with four bands in the 
visible and near infrared. Landsat 3 has a thermal channel as well which should 
become operational in May 1978. 

Landsat D, to be launched in the early 1980’s, will have an MSS similar to 
Landsat 1, 2, and 3, and will carry a Thematic Mapper (TM). The TM will have 
seven bands; four visible, two infrared, and one thermal. For a complete dis- 
cussion of Landsat satellites see Landsat Data Users Handbook (1976), Doyle 
(1978) and Lynch (1976a, 1976b). 

Landsat data is available through EROS (Earth Resources Observations System) 
Data Center (EDC) in Sioux Fallx, South Dakota. EDO is operated by the U.S. 
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Geological Survey and works cooperatively with the National Cartographic Information 
Center (NCIC). Orders for data can be placed directly to EDC: 


User Services 

EROS Data Center 

Sioux Falls* South Dakota 57198 

Phone: (605) 594-6511 XI 51 

FTS 784-7511 

or by computer terminal link from the following regional field offices: 


/opographic Office 
U.S. Geological Survey 
900 Pine Street 
Rolla, MO 65401 
Phone: (314) 364-3680 

Hours: 7:45 - 4:15 

NCIC Information Unit 
National Center - Stop 507 
12201 Sunrise Valley Drive 
Reston » VA 22092 
Phone: (703) 860-6045 

Hours: 7:45 - 4:15 


Air Photo Sales 
U.S. Geological Survey 
Federal Center. Building #25 
Denver. CO 80225 
Phone: (303) 234-2326 
Hours: 7:45 « 4:15 

Map and Air Photo Sales 
U.S. Geological Survey 
345 Mlddlefleld Road 
Menlo Park. CA 94025 
Phone: (415) 323-2157 

Hours: 7:45 - 8:15 


There are several EROS Applications Assistance Facilities which maintain micro- 
film copies of data archived at the Center and provide computer terminal inquiry 
and order capability to the central computer complex at the EROS Data Center. 
Scientific personnel are available to assist in ordering. 

The addresses are: 


. EROS Applications Branch 
U.S. Geological Survey 
Room 202, Building 3 
345 Mlddlefleld Road 
Menlo Park, California 94025 
Phone: (415) 323-8111 

Hours: 8:00 - 4:15 

EROS Applications Assistance 
Facility 

EROS Data Center 

U.S. Geological Survey 

Sioux Falls, South Dakota 57198 

Phone: (605) 594-6511 

Hours: 8:00 - 4:30 


EROS Applications Assistance 
Facility 

HQ Inter American Geodetic Survey 
Headquarters Building 
Drawer 934 

Fort Clayton, Canal Zone 
Phone: 83-3897 
Hours: 7:00 - 3:45 

EROS Applications Assistance 
Facility 

U.S. Geological Survey 
Suite 1880 
Valley Bank Center 
Phoenix, Arizona 85073 
Phone: (602) 261-3188 

Hours: 8:00 - 5:00 



U.S. Geological Survey 
Room B-207-A, Building 1100 
National Space Technology 
Laboratories 

Bay St. Louis, Mississippi 39520 
Phone: (601) 688-3541 
Hours: 8:00 - 4:30 


EROS Applications Assistance 
Facility 

University of Alaska 
Geophysical Institute 
College, Alaska 99701 (Fairbanks) 

Phone: (907) 470-7558 

Hours: 8:00 - 5:00 

In addition, several EROS Data Reference Files have been established throughout 
the United States which maintain microfilm copies of data available from the EROS 
Data Center. However, they do not have personnel for consultation. The addresses, 
telephone numbers and hours of operation are as follows: 


EROS Applications Assistance 
Facility 

U.S. Geological Survey 
1925 Newton Square East 
Reston, Virginia 22090 
Phone: (703) 860-7871 

FTS: (900) 860-7871 

Hours: 8:00 - 4:15 


EROS Data Reference File 
Public Inquiries Office 
U.S. Geological Survey 
108 Skyline Building 
508 Second Avenue 
Anchorage, Alaska 99501 
Phone: (907) 277-0577 

Hours: 9:00 - 5:30 

EROS Data Reference File 
Public Inquiries Office 
U.S. Geological Survey 
Room 7638, Federal Building 
300 North Los Angeles Street 
Los Angeles, California 90012 
Phone: (213) 688-2850 

Hours: 9:30 - 4:00 

EROS Data Reference File 
Water Resources Division 
U.S. Geological Survey 
Room 343, Post Office and Court 
House Building 
Albany, New York 12201 
Phone: (518) 474-3107 or 6042 

Hours: 8:00 - 4:30 

EROS Data Reference File 
Bureau of Land Management 
729 NE Oregon Street 
Portland, Oregon 97208 
Phone: (503) 234-3361 X400 

Hours: 8:00 - 4:00 


EROS Data Reference File 
Topographic Office 
U.S. Geological Survey 
900 Pine Street 
Rolla, Missouri 65401 
Phone: (314) 364-3680 

Hours: 8:00 - 5:00 


EROS Data Reference File 
Water Resources Division 
U.S. Geological Survey 
975 West Third Avenue 
Columbus, Ohio 43212 
Phone: (614) 469-5553 

Hours: 8:00 - 4:30 


EROS Data Reference File 
University of Hawaii 
Department of Geography 
Room 313C, Physical Science 
Building 

Honolulu, Hawaii 96825 
Phone: (808) 9-4-8643 

Hours: 8:00 - 4:00 

EROS Data Reference File 
Maps and Surveys Branch 
Tennessee Valley Authority 
20 Haney Building 
311 Broad Street 
Chattanooga, Tennessee 37401 
Phone: (615) 755-2149 

Hours: 8:00 - 4:00 
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In addition to the regular Landsat Facilities, the NOAA has 16 m browse files of 
Landsat data. The reason for this service is that Landsat data is often used in 
conjunction with NOAA satellite data. There are 21 NOAA browse files; however 
th£se files will be terminated in October 1978. 


University of Alaska 
Arctic Environmental Information 
and Data Center 
142 East Third Avenue 
Ahchorage, Alaska 99501 
Phone: (907) 279-4523 

Inter-American Tropical Tuna 
Coimisslon 

Scripps Institute of Oceanography 
Post Office Box 109 
LaJolla, California 92037 
Phone: (714) 453-2820 

National Geophysical and Solar 
Terrestrial Data Center 
Solid Earth Data Service Branch 
Boulder, Colorado 80302 
Phone: (303) 499-1000 X 6915 

National Oceanographic Data Center 
Environmental Data Service 
2001 Wisconsin Avenue 
Washington, D.C. 20235 
Phone: (202) 634-7510 

Atlantic Oceanographic and 
Meteorological Laboratories 
15 Rickenbacker Causeway, 

Virginia Key 
Miami, Florida 33149 
Phone: (305) 361-3361 

National Weather Service, Pacific 
Region 

Bethel -Pauaha Building, WFP 3 
1149 Bethel Street 
Honolulu, Hawaii 96811 
Phone: (808) 841-5028 

National Ocean Survey - C3415 
Building #1, Room 526 
6001 Executive Boulevard 
Rockville, Maryland 20852 
Phone: (301) 496-8601 


Atmospheric Sciences Library - D821 
Gramax Building, Room 526 
8060 - 13th Street 
Silver Spring, Maryland 20910 
Phone: (301) 427-7800 


Lake Survey Center - CLxl3 
630 Federal Building & U.S. Court- 
house 

Detroit, Michigan 48226 
Phone: (313) 225-6126 


National Weather Service, Central 
Region 

601 East 12th Street 
Kansas City, Missouri 64106 
Phone: (816) 374-5672 


National Weather Service, Eastern 
Region 

585 Stewart Avenue 
Garden City, New York 11530 
Phone: (516) 248-2105 

National Climatic Center 
Federal Building 
Asheville, North Carolina 28801 
Phone: (704) 258-2850 X620 


National Severe Storms Lab 
1313 Halley Circle 
Normal, Oklahoma 73069 
Phone: (405) 329-0388 


Remote Sensing Center 
Texas A & M University 
College Station, Texas 77843 
Phone: (713) 845-5422 


National Weather Service, Southern 
Region 

819 Taylor Street 
Fort Worth, Texas 76102 
Phone: (817) 334-2671 


National Weather Service, Western 
Region 

125 South State Street 
Salt Lake City, Utah 84111 
Phone: (801) 524-5131 



Service 

Environmental Sciences Group 
Suitland, Maryland 20233 
Phone: (301) 673-5981 


439 West York Street 
Norfolk, Virginia 23510 
Phone: (804) 441-6201 


Northeast Fisheries Center 
Post Office Box 6 
Woods Hole, Massachusetts 02543 
Phone: (617) 548-5123 

University of Wisconsin 
Office of Sea Grant 
610 North Walnut Street 
Madison, Wisconsin 53705 
Phone: (608) 263-4836 


Northwest Marine Fisheries Center 
2725 Montlake Boulevard East 
Seattle, Washington 98112 
Phone: (206) 442-4760 

National Climatic Center 
Satellite Data Services Branch 
Room 606, World Weather Building 
Washington, D.C. 20233 
Phone: (301) 763-8111 


Digital data are available from EDC as CCT's (Computer Compatible Tapes) in 
the standard NASA format. This format is soon to change when EDC begins to im- 
plement the EROS Digital Image Processing System (EDIPS), which is discussed in 
depth in the next section on data handling techniques. 

If raw data is required on a regular basis it may be obtained from: 

IPF Support Services 
Code 563 

Building 23, Room E409 
NASA Goddard Space Flight Center 
Greenbelt, Maryland 20771 
Phone: (301) 982-5406 


NOAA Satellite Data 

NOAA data is used primarily for meteorological and oceanographic studies and 
are of a much coarser spatial resolution than Landsat data. However, because 
these data have a higher radiometric resolution (8- or 10-bit vs. 6- or 7-bit 
quantization) they are of interest to earth resource monitors. Real time data 
products may be rrdered and sent by mail or user's communication links by contact 
ing: 

Nationel Evnironmental Satellite Service (NESS) 

Director of Operations, FOB-4 
Washington, D.C. 20233 

Retrospective (archived) data products may be obtained from: 

National Climatic Center (NCC) 

Satellite Data Services Branch 
Room 606, World Weather Building 
Washington, D.C. 20233 

Experimental Satellites 

A number of other satellites are now or will be in the next few years pro- 
viding digital data for experimental studies of the earth and its atmosphere. 

Many of these have poorly defined data output products and no mechanism estab- 
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Hshed for data dissemination. Users of these data may be required to have 
verf sophisticated data processing facilities at their disposal. Some of these 
satellites are listed below and references are listed at the end of this section 
for further information on obtaining these data. 

* Seasat-A 

Launch date: mid 1978 

principal sensor: microwave 
application: ocean monitoring 

* GEOS-3 

launch date: April, 1975 

principal sensor: radar altimeter 

application: topographic mapping with extreme accuracy 

* AEM-A; (HCMM) 

launch date: mid 1978 

principal sensor: Heat Capacity Mapping Radiometer 
application: continuous sensing of surface thermal character- 
istics 

* LAGEOS 

launch date: May, 1976 

principal sensor: none; passive laser reflection 

applications: accurate determination of continental drift, 

plate tectonics, and distortions of the earth surface 

* SEOS-A 

launch date: mid 1980's 

principal sensor: LEST (Large Earth Survey Telescope) 
application: high resolutuion continuous coverage of earth re- 

sources subjects 

* SHUTTLE (manned vehicle which can fly into satellite o»bit) 

launch date: 1979 or 1980 

principal sensors: undetermined 
applications: unlimited 
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DATA HANDLING TECHNIQUES FOR EXISTING DIGITAL REMOTE SENSING 


DATA 

CURRENTLY AVAILABLE DATA 

This discussion will be limited to Landsat data, which was the paramount 
concern of the conference. The basic references for this section are Thomas, 

1975, and NASA, 1976. 

A Landsat scene in digital form consists of 2340 scan lines. The nominal 
number of pixels per scan line is 3240, which, based on a scene size of 185 by 
185 km, yields a nominal pixel size of 79 by 57 m (about 1.12 acres). In practice, 
the line length may vary from 3000 to 3450 pixels. The line length on the CCTs is 
adjusted to a multiple of 24 by systematically repeating selected pixels in lines 
which fall short of the adjusted line length (LLA). 

The logical structure of the Landsat digital data is band interleaved by 
pixel pair. The fundamental unit of this structure is an eight-byte group, which 
consists of band 4 radiance numbers (RNs) for two adjacent pixels, followed by 
RN pairs for bands 5, 6, 7. All RNs are represented as eight-bit bytes, right- 
justified, with either 6 or 7 bit quantization. 

Landsat data are stored on the CCT as four files, each file representing a 
quarter-width vertical strip of ? single scene. In order to reconstruct scan 
line N in its entirety, one therefore must extract and concatenate record N + 2 
from each of the four files. The '+ 2 1 allows for the I.D. and annotation record. 
These records are the first and second entries, respectively, in each strip file, 
and contain the following Information: 

ID record: scene/frame ID 

file sequence number 
data record length 
data mode/correction code 
adjusted line length 

annotation record: Greenwich date of exposure 

format center 1 at/1 on 

nadir lat/lon , 

sun elevation and azimuth 

satellite heading 

orbit number 

data acquisition site 

In addition, the last 56 bytes of each scan line contain sensor radiometric 
calibration coefficients and sample calibration levels. 

The physical structure of the CCT may be specified as either 7 track, 800 bpi, 
or 9 track, 800 or 1600 bpi. At 1600 bpi density, all four files are placed on a 
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single 2400 foot tape. At 800 bpi, 2 tapes are required. On all tapes or tape 
sets there is a fifth file composed of data from the Special Image Annotation 
Tape (SIAT), containing information on spacecraft attitude and additional sensor 
performance data. 

SUPPLIER PROCESSING OF LANDSAT DIGITAL DATA 

Landsat MSS data enters the NASA Image Processing Facility (IPF) in the 
digital domain as wideband video tapes. The first operation performed oh the 
data is reformatting to high-density digital tapes (HDDTs). These HDDTs are then 
input to a subsystem which performs radiometric calibration, decompression if 
bands 4, 5 and 6 have been compressed, and a second reformatting to the CCT dis- 
tribution format. The system as currently configured (1977) applies geometric 
corrections only to photographic imagery, and these in only the most general sense. 

The NASA IPF is currently being supplanted by the Master Data Processor (MDP) 
as the EROS Digital Image Processing System (EDIPS) nears completion. MSS video 
tapes entering the MDP will still be converted to HDDTs, but no photographic pro- 
ducts will be generated. All subsequent processing will take place in the digital 
domain. The HDDTs will be radiometrically corrected and will have geometric cor- 
rection coefficients written on the header. Corrected HDDTs will also be output 
which have been geometrically rectified to either a UTM, Polar Stereographic, or 
SOM projection. Resampling will be performed with either a cubic convolution or 
nearest neighbor algorithm. 

NASA will retain only a limited "quick-look" capability for image generation 
or CCT production. Under the new system configuration, all Landsat data will 
leave NASA in HDDT form, for product generation by the EDIPS system at EDC. EDIPS 
is expected to become operational by mid-1978 (EROS Data Center, 1978), and 
will be the user community's primary source of Landsat digital data. EDIPS' pri- 
mary digital product will be CCTs generated from fully corrected HDDTs, using the 
SOM (Space Oblique Mercator) projection and cubic convolution resampling. The 
logical structure will be either band-sequential or band-interleaved-by-line. 

Other projections (UTM, Polar Stereographic), resampling algorithms (nearest 
neighbor), and unprocessed ("archival") tapes will be available only on a special- 
order basis. 

Various types of enhancement will be applied to digital data products in the 
EDIPS system (TRW, 1977). Default processing options include haze removal by sub- 
traction of a scene-dependent haze bias, and contrast stretching according to a 
histogram equalization. In addition, users may request an RN histogram of each 
scene, and edge-enhancement by a Laplacian algorithm. 
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USER PROCESSING OF LANDSAT DIGITAL DATA 


Currently, users are forced to perform most data corrections themselves. 
Examples follow of two systems which have been developed, at quite different 
scales, within the user community. 

IBM has developed a large scale Image Processing Facility (IPF) for the 
manipulation of Landsat data (Bernstein and Ferneyhough, 1975). Input data is 
radiometrically calibrated using the NASA procedure, then subjected to 'supple- 
mental' calibration which removes striping by statistically equalizing the de- 
tector output. Geometric correction of Landsat data is accomplished by selecting 
ground control points in the image and using these points to evaluate coefficients 
of spacecraft attitude functions. These functions are evaluated for a selected 
number of points in the image, yielding an 'interpolation grid' from which pixels 
for the rest of the corrected image may be located by bilinear interpolation. 

Once new pixel locations are established, their radiance numbers are assigned by 
one of several resampling algorithms. The rectification process is made more ac- 
curate by the a priori specification of systematic sensor errors, which are re- 
moved by direct modeling. These include distortions such as mirror velocity 
nonl inearity, line perspective, scan skew, and earth rotation. 

A smaller-scale system with similar objectives has been implemented at the 
U.S. Army Engineer Waterways Experiment Station (Williamson, 1977). The con- 
straint upon this system was a small (16K) computer. Data are corrected for 
pixel aspect ratio by repeating every third and twentieth scan line. Earth 
rotation distortions are compensated by offsetting every 12 scan lines one pixel 
westward. These systematic distortions are the most visible and are therefore 
the only ones explicitly corrected. The remainder of the geometric distortions 
are accounted for by translation, rotation, and affine "stretching" equations 
using coefficients derived from regression upon ground control points. These 
equations are evaluated for each point in the corrected image to yield an address 
on the original image, to the nearest whole pixel. This avoids radiometric 
Interpolation but results in a loss in positional accuracy. The overall tech- 
nique described is certainly less accurate than the IBM system, but is signifi- 
cantly less costly to implement and operate. 

The two systems described above represent a cross-section of user responses 
to the Landsat data correction problem. Part of this problem will be alleviated 
by the implementation of ED I PS, but there are other difficulties to be overcome 
before the georeferenced use of Landsat data can become more widespread. 
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GENERAL CONSIDERATIONS OH COMMON FORMATS FOR 


GEOBASE INFORMATION SYSTEMS AND SATELLITE IMAGE PRODUCTS 


There are currently a large number of geobase information systems in exist- 
ence, and an even larger number under development or under consideration. The 
diverse purposes for which these are intended, and the incompatibilities between 
thdm in their original data, objectives, and analytical methods, seems to make 
it infeasible to find some common format philosophy or philosophies for geo- 
graphic information systems. Within major federal agencies it is known that many 
.data bases are being developed without knowledge of those already existing in the 
agdncy. Some 50 major geographically-based systems, all essentially independent, 
exist in the U.S. Geological Survey alone. 

The inclusion of satellite data into these information systems further com- 
plicates the problem. How many of these should or could attempt to incorporate 
satellite image data is unknown. Despite these complexities, the search for 
standardization on some acceptable numbers of formats should proceed. This is 
both a general federal agency and state agency responsibility, since information 
systems development with geographically coded data is common to both parties. 

The complexity of this area is such that there are no easy solutions and no 
simple recommendations. The following discussion reflects both the difficulty 
of the problems and the diversity of opinions among those who attended the con- 
ference . 

* A procedure for windowing and reformatting Landsat so that is could 
directly overlay USGS map quadrangles would increase the utility of 
remote sensing data in geobase information systems. If this procedure 
is initiated it will result in the attraction of new users, reduced 
handling and processing time for all users, and an overall cost savings. 

* New geographic data bases designed to provide the public with information 
and serve the needs of federal and state agencies should be structured 

to offer maximum flexibility for conversion by external users. Both 
polygon and gridded approaches to a data base structure must continue to 
be evaluated. Both types may need to be supported by providers of image 
products. 

* A workshop bringing together representatives from local, county, regional 
state and national levels of government is recommended. The workshop 
should examin both the recommendations given in this report, and the 
development of upwardly compatible geobase information systems between 
hierarchies of government. 

* At the state level a central agency should examine a common format phi- 
losophy in respect to its impact on the geographic information needs of 
the state. A formal procedure should be developed to support implemen- 
tation of a corranon format at all agency levels. 
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Mew Formats for Digital Image Products 


It is recognized that digital image products must serve the needs of two 
very different segments of the user and potential user community. The first is 
the very sophisticated research or industrial user; the second is the typical 
state, regional or county planning or resource group, which even collectively 
still lacks the resources to work with the raw digital image data. It was evi- 
dent at the workshop that while these two types of users require different data 
products, that both would benefit from the availability of smaller, more specific 
image segments because: 

1. Data storage, manipulation, processing, and subsequent information ac- 
cessing would be simplified because the product would typically include 
far fewer pixels than the currently used "frame" 

2. The potential of mini-computer use for digital image processing could 
be more fully realized because the size of the data sets to be mani- 
pulated would be reduced. This would serve to attract more potential 
users to both segments of the user community. 

While the fully re-formatted products are aimed at the less sophisticated 
user, it was the confident expectation at the workshop that the merits of such 
products would soon bring all but a handful of researchers to prefer them, and 
to employ them routinely. 

ATTRIBUTES 

The advantages of the fully-reformatted products are as follows: 

1. Resampled pixels would be consistent as to location, thereby facil- 
itating (a) geobase file updating, (b) temporal comparisons of all 
types, (c) establishment of other file-manipulation procedures based 
on a standard location. 

2. Arraying of square pixels in UTM (Universal Transverse Mecator) 
blocks is compatible with the raster formats of image-based geo- 
graphical information systems. 

3. Local users would be upwardly compatible with larger system users 
through development of data transfer based on a common format. 

4. Significant economies of scale would be achieved through central- 
ization of processing, with resultant overall system cost savings. 

5. New users will be attracted. 

6. There will be a notable reduction in the handling and processing 
time required of users themselves, to say nothing of the simpli- 
fication of many software development problems. 

The timeliness of delivery of such products is of continuing concern to 
users. It is important to ensure that these data products be delivered to users 
within 48 hours to 7 days depending on the urgency of the problem. Urgently- 
required materials may carry some premium pricing. 
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Present data delivery time Is both slow and Inconsistent. If an Individual 
program manager is willing and able to use digital Image data as an element in 
his decision-making process* It is most important that the data be delivered with 
in a time frame which will allow ample time for analysis. The present system of 
data delivery gives the user no guarantee In terms of time of acquisition and de- 
livery. 

The decision making processes of federal and State agencies will not tol- 
erate— or be willing to risk their decision alternatives on— a data source which 
is unreliable. The question of operational, and hence guaranteed, data delivery 
must soon be faced. 

* 

State Involvement in Federal Data Supply Decisions 

Present steps for federal -state cooperation in technical criteria, cost 
policy and delivery need to be strengthened and improved. We know that they are 
already in place, but they are just not good enough. 

Possible coordination between NASA-US6S-0MB-0STP and National Council of 
State Legislators, the National Governors Association, and various State Plan- 
ning Agencies should be explored. 

Data Handling Responsibility within Government 

Government agencies at all levels have a fundamental responsibility to be 
able to provide Information systems which will offer the maximum public benefit 
over a long period of time. This is not a responsibility that can be shifted to 
the private sector for any extended period. 

Data gathering agencies that provide data in a form that requires sophisti- 
cated technical manipulation before it is useful in the planning/decision making 
processes of government simply exacerbate the situation. 

Federal agencies must provide basic data sets in a form that can be readily 
used* by state and local government, with the staff, level of competency and avail 
ability of computational machinery that now exists in those governments. 

This step will: 

1. Encourage state and local governments to accept their responsibility 
to staff up for their own data handling operations. 

2. Create a wider body of data users, that will in turn create an expanded 
market for special purpose products that can be supplied by the private 
sector. 

Role of NASA as R & D Agency 

The role of NASA as a research and development agency is not widely recog- 
nized or understood. Data in and of itself have no merit; they are either useful 
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or not useful. To be useful in the government context requires Institutional 
change to adapt to the new data types and possibilities. All Institutional 
change In local and state government requires upheaval, commitment and expend- 
iture of sometimes scarce resources. These changes are taking place In response 
to the data provided by NASA. The changes are regarded locally as permanent - 
with NO understnading that NASA is acting In a research and development mode. 

Regardless of whether the R & D nature of NASA activity has In fact been 
made clear to user governments, an institutional change In local and state govern- 
ments Is not an R & D venture . 

The strategy of long-term data supply must be immediate concern to NASA if 
serious lack of confidence, misuse of funds and poor use of scarce techni-al 
resources is to be avoided. 

Common Data Formats for Diverse Data Bases 

On the matter of formats of data bases it Is important that we define the 
term data base as Interpreted in this discussion. Data bases can exist in both 
physical hard copy forms, e.g, maps film, statistical data, and in computerized 
forms containing representations of these data in digital form. For purpose of 
this discussion, a data base was defined as a computerized or automated assembly 
of data. 

Automated data bases can be divided into three basic application types: 
internal, external and a combination of both. To further elaborate on these ap- 
plication types, internal data bases are those designed and containing data for 
internal federal or state agency applications. An example of this type of data 
base might be NOAA’s National Weather Service data base of weather data which is 
used for weather forecasting purposes. An external data base Is one designed to 
provide the public with information, such as the Department of Interior’s Census 
Statistics data base. The third type of data base is one designed to satisfy 
both internal agency requirements and external user requirements. An example of 
this might be World Data Bank II, developed by the CIA to meet internal carto- 
graphic needs, but now available to external users through NTIS. 

The design of new geographic data bases that will be of the external or 
combination types should be designed to offer maximum flexibility for conversion 
by external users. The workshop members were divided on to whether the point/ 
polygon design sturcture offers this flexibility better as compared to the grided 
approach to a data base sturcture; and both approaches must continue to be eval- 
uated, along with the development of efficient algorithms for conversion between 
them. 
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It Is essential that a standard projection (preferably UTM) be Integrated 
Into the design structure. It was recognized that the standardization of the 
UTM projection represents a substantial cost to several federal agencies (NASA, 
001, EDC), but It was felt that this cost should be Increased once by the federal 
agencies rather than a multitude of times by the many users. There would be an 
overall cost savings because federal agencies finance much of the user processing 
and because data processing for most users would be greatly simplified. Because 
the UTM projection Is the standard for most of North America, digital Image data 
preprocessed with UTM as the standard projection could be easily Integrated or 
overlayed on geographic map data. 

The preceding discussion applies to those data bases which may still be 
changeable or remain to be developed. However, most data base design and develop- 
ment activities preclude such changes once programming has begun. 

Modification of existing national data bases is not likely. Federal data 
base development activities are dictated to a large degree by Federal Information 
Processing Standards (FIPS) and local agency standards and regulations which would 
in most instances limit an agency's ability to make changes, after the fact 

With these considerations in mind, a principal concern expressed at the work- 
shop was on the necessity for Improved communication on such matters between 
agencies and levels of government to avoid freezing even more information sys- 
tems in forms which are not easily compatible with digital Image data. 

The establishment of a new common data format for digital image data thus 
becomes important not merely because It appears most compatible with user needs, 
but also because It can In turn create an environment within which more geobase 
information systems will be designed to be compatible with such a data source. 


RECOMMENDATIONS AND SUPPORTING DISCUSSION 


GENERAL RECOMMENDATIONS FOR FUTURE DIGITAL IMAGE FORMATS 


If satellite data Is to attract a substantially larger user com- 
munity In the future, it must satisfy the needs of applications-orlented 
users as well as those involved in research and development. NASA or EDC 
should be able to provide digital image products which are geared toward 
both types of users. It is with regard to this dichotomy In user demand 
that the following recommendations, are made: 

RECOMMENDATIONS 

Two distinct families of imagery should be available to users on a 
standard product basis: 

* Uncorrected video data, either In completely unaltered ‘raw* 
form, or radiometrically calibrated but not corrected, but with 
substantial header and annotation information. 

* Fully processed data, which has been radiometrically corrected (includ- 
ing destriping) and geometrically rectified to a UTM projection. 

Both product families should include as annotation the geodetic 
coordinates of selected control pixels within the scene which could be/have 
been used to perform a rectification to a geobase reference system. Both 
product families should be accessible by segments corresponding roughly/ 
exactly to 71/2 minute USGS topographic quadrangles. The fully processed 
data should exhibit east-west orientation of the scan lines. Also: 

* Processed data should be available in certain other projections 
and formats as ’special -order' products, for which a somewhat 
longer order turnaround would be acceptable. 


DISCUSSION 

Uncorrected data 

The raw, uncorrected/cal ibrated data is needed by users who wish to 
perform local, precise radiometric measurements. These users generally 
have their own hardware and software for resampling and correcting the 
data. Annotation supplied with uncorrected tapes should contain information 
on sensor electronic and mechanical performance parameters, spacecraft 
attitude, etc., so that radiances can be derived from an "idea!" spacecraft 
in an ideal orbit. Dissemination of raw data will play a significant role in 
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determining the effect of resampling on classification and radiometric 
accuracy because If will, allow the use of different resampling schemes. 

Uncorrected data should be supplied with sufficient information to 
permit sophisticated users to rectify the Imagery without themselves 
acquiring ground control Information. This would probably consist of 
Identifying a minimum number of pixels In the scene In terms of geodetic 
coordinates, and/or highly accurate spacecraft attitude information. 

All users should be able to obtain only those portions of Imagery . 
relevant to their areas of inquiry. Users should not have to purchase and 
manipulate vast quantities of data which are of no use to them. In order 
to extract an Important fraction thereof. To this end NASA should permit 
accession of all types of digital data by a latitude-longitude spatial 
window, subject to some minimum size constraint such as a USGS 71/2 minute 
topographic quadrangle. Virtually all conference participants agreed that 
standardization to 7 1/2 minute quads would be close to an optimum format. 

Corrected data 

To Improve the dissemination of remote sensing data NASA should 
redefine Its user community to Include less sophisticated users. Improved 
data flow to experienced users may actually inhibit flow to first-time users. 

The fully corrected product recommended above should remedy this problem 
by providing, off-the-shelf, data which is in a standard projection, gridded 
in a manner that makes it easily compatible with other digital data bases. 

One r>i the major difficulties in the use of Landsat data has been the 
high processing cost of geometrically correcting the data. This Is a problem 
of both software and hardware that is caused by the mass of data (3 X 10^ bytes) 
in one Landsat scene, and is compounded by the tilt of the spacecraft orbit. 

While rectification software is available, it Is usually designed for larger 
computers and is time consuming to bring up even on these. The general 
feeling of the conference was that the majority of potential new users are 
smaller institutes or state agencies more interested In applications that in 
research and development. As such they have limited computer resources, 
usually consisting of mini-computer systems or access to small time-sharing 
systems. On such systems bulk 1/0 tends to be difficult or expensive. For 
the smaller user the computational cost of converting Landsat data to a usable 
data base format can become prohibitive. Moreover, the level of Information 
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exchange within the user community is currently such that an Institution 
which is primarily applications oriented may incur considerable costs 
developing a Landsat data-correction system* only to discover that much or 
all of an identical system has already been developed elsewhere. This 
duplication of R & D efforts, coupled with the limited portabili > of 
existing software packages, is a powerful limiting factor in the broader 
utilization of Landsat data. 

For Landsat data to be useful it must be in a form compatible with 
other digital data bases. In addition to geometric rectification the data 
should be resampled to an east-west scan direction. This "rotation" is 
costly and difficult to perform on a small computer and should be done 
once by NASA. The recommendation to NASA that the projection be UTM is 
based on the need for a conformal projection which allows Cartesian (x,y) 
referencing with minimal distortion. From user comments the UTH alignment 
appears to be the most common alignment of user-community databases and 
other Government data structures such as the NCIC digital terrain tapes. 
Information lost in resampling to a new grid alignment was generally conceded 
to be insignificant compared to the difficulty of working with data structures 
which are not aligned. Some agencies will be using data referenced in state 
plane coordinates, and it would be useful to know by how much these differ 
from the UTM grid (it may be less than the Landsat sampling interval). 

The corrected imagery should have a much more geographically specific 
accession structure than the uncorrected imagery. The most widely accepted 
proposal for a standard product was a direct overlay of the USGS 7 1/2 
minute topographic quadrangle series, with fixed sample locations to facilitate 
multi temporal analysis. 

Logical formatting of either uncorrected or corrected data should not 
be restrictive to users whose computer systems have limited 1/0 buffering or 
high 1/0 costs, nor should it discriminate with respect to users interested 
primarily in multispectral classification, image generation, or database 
augmentation. Since no single logical format seems to fulfill these re- 
quirements, it is recommended that the following three logical structures be 
available on request: band interleaved by pixel (BIP), band interleaved by 

line (BIL), and band interleaved by scene (band sequential or BSQ). 

Special order products 


The conference recognized the Importance of specifying standard. 


'pipeline 1 products for which NASA and EDC could configure their production 
system for high volume and maximum availability. However, there was a 
general agreement that the Interests of users requiring specialized pro- 
ducts should be protected, in that NASA or EDC should make a variety of 
processing options available on a retrospective basis. These might Include: 

nonstandard quadrangles (e.g., to fit specific study areas) 

nonstandard projections (e.g., to fit numerous state plane 
systems ) 

nonstandard sample spacings (e.g. , to yield specific output 
scales on line printers, film writers, "grid" information 
systems, etc.) 

The workshop members anticipated that the costs for these special products 
might well be more than for the standard products. 


ANNOTATION 


For digital data to be properly and efficiently processed, it must be as- 
sociated with a thorough and accurate annotation record. The annotation must 
provide in a simple format the information required by the myriad users of data 
from Landsat-3, the themrtic mapper on Landsat-D and other future space imaging 
systems. While the needs of these users will vary, the following recommendations 
and subsequent discussion are based upon the premise that the more annotation 
information provided to the user, the more universally usable Landsat and future 
Imaging Systems data will become. ’ 

RECOMMENDATIONS 

Complete annotation should be made available to users of either raw or 
reformatted satellite data. All forms of annotation currently provided should 
continue to be provided. In addition, it is recommended that the following in- 
formation be supplied with all digital image data: 

* Type of projection, if applicable. 

* Sampling interval in meters on the ground. 

* Up-to-date algorithms and coefficients for converting digital radiance 
numbers to actual space upwelling radiances. 

* Several ground cqntrol points identified in image and standard geo- 
reference coordinates. 

* More accurate geodetic grid ticks and nadir/format center coordinates 
(preferably designated in latitude and longitude). 

* More accurate spacecraft positional (altitude and attitude) information 

* More spatially specific cloud coverage estimates. 

DISCUSSION 

Utilization of existing digital image annotation varies greatly within the 
user community. Some users ignore it completely, although none were willing to 
suggest deletion of any existing annotation. On the other hand, some users make 
extensive operational use of annotation data; for example, the use of spacecraft 
heading to derive a systematic rotation correction. Recent deletion of this para- 
meter f^om the annotation has caused some concern. Most users found the an- 
notation data currently supplied to be deficient in some respect and recommended 
changes and additions. There was general agreement at the conference that NASA 
should supply all available annotation of CCT's. 

If the image has been transformed to a specific map projection, the annota- 
tion should include a description of the projection, the specific parameters em- 
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ployed, and references to the relevant forward and inverse transformation al- 
gorithms. 

Spatial sampling interval on the ground should be specified for each image 
(corrected for systematic distortions such as line perspective, scan nonlinearity, 
orbital parameters and earth curvature). This would be more accurate than the 
current practice of simply dividing the dimensions of the image matrix (e.g., 

2340 lines by 3240 samples) into some standard frame size (e.g., 185 km on a 
side). 

Conversion of digital radiance numbers (RNs) to actual space upwelling 

O 1 

radiances (Hm sr ) is usually accomplished by linearly mapping the quantization 
level onto the radiometric response (dynamic range) for each band. Since both 
the dynamic range (i.e.» threshold and saturation values) and the order of the 
calibration equation may be subject to revision, it is essential that sufficnent 
data be included with each CCT to permit rapid voltage-radiance conversions with 
the maximum accuracy achievable at the time the tape is generated. 

Ground control points are necessary both for users of uncorrected tapes who 
wish to do their own rectification, and for users of corrected data who may wish 
to check the accuracy of the data for themselves. It is important that each 
Image contain some reasonable minimum number of points (e.g., 10), which are iden- 
tified In the annotation block in the image and geodetic coordinates, so that 
acceptable overlaying of image data to other image data, maps and information 
systems may be achieved. The RMS error of these control points in both image and 
geodetic space should be specified, so that some inferences may be made as to 
the accuracy of transformations based on these points. The geodetic system of 
reference should be latitude and longitude, on a specified ellipsoid, to be ac- 
cessible to the largest number of users. There was general agreement among users 
attending the conference that latitude and longitude is the best coordinate sys- 
tem for location of ground control points. 

Currently supplied corner and edge ticks, and nadir and format center co- 
ordinates, are too coarse to be useful for image rectification and location of 
pixels on the ground. These should be of the same order of accuracy as are those 
produced on the fully corrected image product, both for corrected and uncorrected 
data. If this requirement is met, it follows that these edge and center tick 
marks will also constitute additional control points. 

Current (1978) spacecraft positional data is of insufficient accuracy to 
permit its use in scene correction. It is our understanding that, beginning with 
Landsat-3, improved spacecraft positional accuracy will be achieved. This new 
data should be supplied in more accurate form with both data modes, for users 
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who wish to rectify full scenes on their own, and for those users who wish to 
find the positions of single pixels at a time. Concern was expressed by users 
interested in receiving and manipulating raw data that complete information be 
abailable in the CCT annotation record for image rectification. 

Current cloud coverage estimates as supplied by the EDC are too vague to be 
useful to those users whose area of interest covers only a specific portion of a 
Landsat scene. It was suggested that cloud cover estimates include some comment 
on the spatial distribution of the cloud cover within the scene; e.g.» by Cartesian 
quadrant, by CCT strip file, etc. There was discussion of the need for availabil- 
ity of quick-look, low quality imagery from EDC so that users could effectively 
evaluate imagery before ordering. This has been done successfully in Canada 
using photo-facsimile products 48 hours after satellite overpass. EDC's proposed 
image microfiche Index will probably fill this need when it becomes operational. 
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RECTIFICATION 


As recomended above, future Landsat imagery should be available both as 
raw data with improved header information, and also in a format which has been 
fully rectified to a standard projection, such that the tape may be accessed 
directly by a geodetic coordinate system. The following recommendations are 
made as to the specific nature of the fully corrected product. 

RECOMMENDATIONS 

The rectified imagery should meet the following specifications: 

* The map projection employed should be conformal and the pixels should 
be aligned east-west, amenable to Cartesian referencing. Universal 
Transverse Mercator was the projection preferred by users attending 
the conference. 

* Rectified imagery should be indexed to existing USGS 7-1/2 minute 
quadrangle topographic maps, and sample locations should be fixed. 

* Resampling should be done only once and should either preserve the 
nominal spatial resolution of the sensor, or, if confirmed by further 
study, transform to one of a series of standard resolutions; e.g., bi- 
nary (16, 32, 64, 128... meters) or "10/25“ preferred by some users 
(10, 25, 50, 100, 250... meters). 

* Projections other than the recommended UTM standard should be available 
upon request, but the transformations should be done from the raw data 
to avoid resampling data more than once. 

In terms of assessing the effect of image rectification on image data 
accuracy the following is recommended: 

* Further investigations should be undertaken to determine the effects 

of various resampling algorithms, including double resampling, on radio- 
metric and spatial fidelity and classifier performance. 

* The rectification algorithms used should be provided with the data to 
the user community and algorithms for forward and inverse transforms 
between a variety of standard projections should be available for dis- 
tribution to users. 

Rectification of imagery to a conformal projection assures that small 
shapes will be preserved and the general 'sense' of the image will be undis- 
turbed. There was considerable discussion of exactly which projection should 
be used in light of the extremely varied needs of user agencies across the 
country. The consensus of this was: 

1. If possible, given cost and processing constraints, users should be 
able to specify a variety of projections when ordering data. 

2. If the standard product is to have only one standard projection, 
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then UTM is the most desirable because of .its generally east-west 
orientation and widespread use (except at high latitudes where the 
Polar stereographic projection is more appropriate). 

3. Most users find the Space Oblique Mercator (SOM) unacceptable, because 
it yields a grid rotated too radically for use with other geobase data 
systems, and which differs from spacecraft to spacecraft. 

Some users prefer UTM outright because their other data bases and many US6S 
topographic sheets are already in this projection. Other users prefer the 
Lambert Conformal Conic (LCC) projection because it precisely matches the State 
Plane Coordinate System in many states. The difference between these projections 
at commonly used scales is mainly one of orientation; it is therefore not un- 
reasonable to request both products. However at the scale of a 7-1/2 minute 
quadrangle, the differences are almost impossible to detect. 

The problem with either UTM or LCC is in areas which occur across the 
boundaries of zones. Zone overlap is fairly easy to correct for; at the scale 
of a 7-1/2 minute quadrangle this would not be likely to occur. A study area 
spanning several zones would be a more difficult problem. 

Space Oblique Mercator is unacceptable because no other geodetic data are 
oriented to SOM. Image data has been resampled by NASA to SOM and then must 
be resampled again by the user to be useful. One of the issues universally 
agreed upon at the conference was that any resampling should be done only once. 

Current indexing techniques for Landsat data leave much to be deisred. 
Indexing by frame does not correspond to any other geobase data format and 
thereby requires extensive modification to overlay Landsat data to other data 
bases. The main problems with indexing by frame are: 

1. The size of the frame (roughly 185 km square or 7 1/2 million 
discrete spatial locations, or with 4 bands 30 million pixels) is 
much too large for most small users with limited computer resources. 

2. The artifical frame boundaries are a memorialized holdover from 
the days of photo interpretation, and as such inhibit the use of 
Landsat data as a digital geobase data system. Providing mosaik- 

ing of multiple scenes and sub-scene segments to users is appropriate. 

While some users may be interested in very large multiple frame segments 
of Landsat data, most smaller agency users desire smaller subimage segments. 
Satellite data is received as a continuous strip, hence boundaries and segment 
size could be user defined. This would present storage and processing problems 
for EDC. A standard indexing system should be employed which Is manageable for 
EDC and correlates well with other digital data bases. There was good agreement 
at the conference that image data indexed by 7 1/2 minutes US6S quadrangles would 
satisfy these constraints: 

1. The data segments would be smaller and more manageable for users 
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with limited computer resources. 

2. Satellite data would be well referenced to the most common geo- 
detic data base, e.g. topographic data or digital terrain tapes. 

3. Problems of data discontinuity at path boundaries would be avoided 
because a 7 1/2 minute quad Is small enough to be within the overlap 
between paths. 

It was recognized that the system would be valid only in areas where 7 1/2 
minute quads have been surveyed, but that includes most of the United States and 
Canada. A system adapted for other parts of the world requires separate analysis. 

Image ‘rotation' or resampling to scan lines along grid eastings is a com- 
putational problem requiring either very large quantities of main memory or a 
sophisticated (and usually slow) virtual memory system, both of which often ex- 
ceed the capabilities of many users. Alignment of the scans so that the pixels 
run east-west in the selected projection, would be most cost-effective if done by 
NASA or EDC at the same time as the rectification. 

The EDI PS system should be able to provide data in a variety of pro* 

jections on a special order basis for users who have needs differing from the 
standard product. In addition, the algorithms for transformation of one pro- 
jection into another should be made available to users. This will facilitate the 
use of Landsat data with other geobase data systems. 

It was the general concensus of the conference that, althougn image re- 
sampling does not seem substantially to affect radiometric and classification 
accuracy, further investigation is needed into the effects of various resampling 
algorithms on radiometric fidelity, spatial resolution, and the performance of a 
variety of classifiers. In lieu of more definitive information on the effects of 
resampling, it is recommended that users have the option of requesting either 
nearest neighbor or cubic convolution resampling algorithms be used in pro- 
ducing their rectified tapes. (For further information on the effect of resam- 
pling, see the selected bibliography). 

To provide maximum flexibility for users who 'personalize' their digital 
data, the algorithms and, more importantly, the ground control points and space- 
craft positional data used in producing the rectified tape, should be made avail- 
able for user inspection. The ground control and positional data should be in- 
cluded as part of the standard annotation, and the algorithms employed in the 
actual rectification should be subject to user modification and reversing, at 
their own installations. 
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RESOLUTION REQUIREMENTS 
SPATIAL RESOLUTION 


Regarding spatial resolution* the conference attempted to strike a 
balance between scenarios which would provide the highest resolution achievable; 
and scenarios which would provide the maximum continuity with existing digital 
data base structures. The following recommendations are those which received 
general support. Other recommendations and some of the debate surrounding them 
are presented in the discussion. 

RECOMMENDATIONS 

* "Raw" digital data should preserve the nominal spatial resolution of 
the imaging sensor. 

* The "corrected" digital data should be resampled at an Interval selected 
from the binary series (i.e., 2 n meters), or the "10/25" series (10, 25, , 
50, 100, 250... meters). For Landsat-D, a suggested resampling interval 
would be 25 or 32 meters. 

* Data compression techniques which irreversibly degrade spatial resolu- 
tion should not be employed, 

* Precise evaluations of the effects on spatial resolution of any resam- 
pling algorithms employed, should be made available, either by broader 
dissemination of existing studies or the initiation of new ones, 

DISCUSSION 

Appropriate spatial resolution for future sensor systems 

It was the consensus of the conference that at least one of the available 
digital data products preserve the nominal spatial resolution of the sensor as 
well as the original radiometry without resampling. That is, raw CCT's should 
continue to be supplied as now. The rationale for this conclusion was that it 
would be pointless to implement a system with, say, 30 meter resolution, if the 
standard data product were constrained to 50 meter resolution. It is 
imperative that users who require high spatial resolution have access to the 
full capabilities of every sensor system. 

Beyond agreement that maximum spatial resolution should be available on 
raw CCT's, there was considerable discussion as to the optimum spatial resolu- 
tion for the standard , high-volume reformatted and resampled digital product as 
described earlier. Significant spatial resolution schemes which were proposed 
are listed below: 

- Spatial resolution as a binary series : There was strong feeling that 

an optimum hierarchy of spatial resolutions could be obtained by 
scaling each resampled standard product pixel size to the nearest 
2 n meters (e.g., 16, 32, 64, ...). Resampling following this scheme 
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would permit highly accurate upward aggregation, assuming the pixels 
were square (i.e., same sample Interval In X and Y). No examples were 
given of existing data bases which follow this scaling hierarchy; it 
Is anticipated that the pixel locations produced by resampling would 
be invariant as to geographic location to ensure direct overlay of 
multitemporal imagery. The principal advantage of the binary series is 
that all pixels nest In the next coarsest resolution, thus making for 
greater simplicity in pixel aggregation and manipulation. 

* Spatial resolution In. a -"10/25-" series : A number of users at the con- 
ference, and a wide group of Canadian users after the conference, ex- 
pressed interest in a sequence of resolutions, here specified as the 
"10/25" series, as follows; 10, 25, 50, 100, 250, 500, 1000... meters. 

Only the step from 10 to 25 and 100 to 250 meters fails to nest and 
users felt this was worth the simplicity of the resolution steps suggested. 
Fixed pixel locations were also desired. 

" Spatial* resolution as multiples of current resolutions : It was noted 

that some existing data bases have adopted the current Landsat sample 
interval as the determinant of their fundamental grid cell. For the 
sake of continuity, many users felt that future digital data should 
be supplied in the current 57 x 79 meter cells or some Integer multiple/ 
fraction thereof. Other users argued that this would be too restrictive 
of future improvements in resolution, and that the current cell size is 
resampled to overlay existing data bases, or to produce constant-scale 
output on specific display devices (film writer, line printer, etc.), 
at least as frequently as it is used 'as is 1 . 

- Nominal sensor resolution as the standard resolution ; Many users felt 
that the nominal sensor resolution was actually the most appropriate 
resolution to offer as a standard product. They reasoned that the 
uses to which Landsat data are put are sufficiently diverse that to 
endorse a particular 'synthetic' resolution might tend to discriminate 
in favor of some users and against others. These users also felt that, 
rather than providing a range of spatial resolutions, NASA should devote 
its energies to other processing options. 

Something of a consensus emerged that the most significant group currently 
interested in a fixed resolution were those users relying on the spatial reso- 
lution of current Landsat data. It was proposed that future Landsats, there- 
fore, either carry an MSS similar to the present ones, in addition to any higher- 
resolution devices, or else that NASA make the data from future high resolution 
sensors available at current Landsat resolutions as one optional product. This 
met with some acceptance. Finally, many argued that data aggregation was a re- 
latively simple enough task to be left to the user, and should not be ascribed 
such importance as to impede future progress in sensor technology. 

The users who pushed most vigorously for either the binary series or the 
"10/25" series of pixel resolutions, and fixed pixel locations, aligned on east- 
ings, argued strongly that such products would greatly expand the usefulness of 
all NASA digital image products, and lead to their much more widespread use. 

There is no doubt that by so doing, multitemporal comparisons, and nesting of 
NASA or NOAA products will be transformed from being very burdensome to simple oper- 
ations. The merits of such standardization warrant careful scrutiny by NASA and EDC. 
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RADIOMETRIC RESOLUTION 


The principal radiometric problems addressed by the conference were cal- 
ibration, striping, and quantization, Users v/ere concerned that radiometric 
corrections be transparent to the unsophisticated user, yet amendable to modifi- 
cation by the sophisticated user. 


RECOMMENDATIONS 

* NASA should continue to improve their calibration procedures, 
and make all algorithms and constants available to the users 
through EDC. 

* Data in which striping is present must be available to the users 
in uncorrected form. 

* Pathwise calibration on an entire strip should be employed, to 
minimize scene-to-scene and along-track within-scene variations. 

* Future video data should be represented as 3-bit, unsigned binary 
integers in the range 0-255. 

* Further investigation (and/or broader dissemination of existing 
data) is needed regarding the effects of resampling on radiometric 
fidelity, and on the feasibility of applying atmospheric corrections 
at the system level. 


DISCUSSION 

Calibration improvements 

Current radiometric calibration procedures were generally felt to be in- 
adequate, in that most imagery still exhibits varying degrees of interdetector 
striping. It was pointed out that this is a hardware as well as software pro- 
blem, that the tolerances of the sensors and film recorders need to be quite 
precise before striping subsides to a level that is undetectable to the human 
eye. If a software calibration procedure other than the current filtered- 
regression is devised, it should be. at least as thoroughly documented as at 
present, with raw calibration levels, gains, and offsets continuing to be sup- 
plied on at least the uncorrected CCT's. 

User striping corrections 

If system calibration fails to eliminate striping, most users would seem 
to prefer to remove it themselves by having access to uncorrected data. It was 
noted that users to whom striping is most offensive have already devised means 
of statistically normalizing interdetector gain variations, and it was not un- 
reasonable to expect them to continue to use their specific algorithms. In 
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addition, some users desired access to uncorrected striped data in order to 
obtain, radiometric levels that have not been subject to statistical 'blurring*. 

It was especially felt that users to whom actual radiometric values, In terms 
of space upwelllng radiance, are Important, should be able to process new CCT's 
themselves. Other user? would very likely want destriplng applied by EDC to 
the corrected, standard product Imagery. 

Spatial scale of calibration algorithm 

The current calibration algorithm is reinitialized for each HOOT entering 
the NASA IPF system. By serially filtering the detector gain and offset, there 
often results noticeable along-track gain variations within a scene, which may 
become substantial over the portion of the path which the HDDT covers. Many 
users would prefer a type of calibration whereby the filtered gain and offset 
resulting at the end of each HDDT is applied uniformly throughout the HDDT. 

This would seem to eliminate some intra- and interscene gain variations for which 
users must presently compensate themselves. 

Quantization and Representation 

Some users expressed dissatisfaction with the limited dynamic range Imposed 
on present Landsat data by a six-bit quantization, especially those users who re- 
quire imagery over snow -^covered areas, where landsat often saturates. It was felt 
that this problem could be alleviated by adopting an eight-bit quantization for 
future Landsat data. It was also suggested that all video data, regardless of its 
quantization level at the sensor, be expanded to fill the range 0-255. This would 
be accomplished by zero-fill of the least significant bit (l.e., multiplication 
by 2^“ n ^, where n is the quantization level at the sensor); or, where appropriate, 
by nonlinear decompression. A standard quantization would simplify comparison of 
radiometric levels from different sensors (e.g., saturation would always be level 
255), and would reduce the number of rescaling procedures necessary when displaying 
the data on devices with other than 256 gray levels (e.g., 1 ineprinters , storage 
tubes, etc. ) 

Further investigations 

Participants required more information on the radiometric effects of re- 
sampling. Especially concerned were those users who convert digital to analog 
radiances. It was felt that further elucidation is needed of the effects of 
standard resampling algorithms relative to these subsequent data conversions. It 
was acknowledged that most users are not concerned with the analog behavior of the 
data, and that in regard to resampling in general, sufficient information may al- 
ready exist and may simply require wider distribution. 


A number of users expressed an Interest in Including atmospheric corrections 
as a standard processing option. Users already familiar with employing these cor- 
rections In their own work cautioned that a universally applicable atmospheric 
correction might not be feasible* and It was recommended that further study be done 
on a possible system-wide atmospheric correction. 
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SPECTRAL RESOLUTION 


The conference was concerned with spectral resolution to the extent that 
future sensor systems provide continuity with existing spectral classification 
schemes . 


RECOMMENDATIONS 

* Future operational sensor systems should (to the degree proven necessary), 
provide spectral continuity with multi spectral bands already operational 
in the user community. 


DISCUSSION 

Users were less concerned with evaluating the utility of the current 
Landsat MSS bands than they were that future operational satellites might 
render current operational applications of MSS data obsolete. It was especially 
feared that the great amount of effort whin, some users have invested in develop- 
ing classifiers based on the current MSS ' division of the spectrum would be for 
naught if data in these bands does not continue to be available. Users expressed 
concern that the data from Landsat-D would require expensive "re-education" of 
currently operational multi spectral classifiers; many were not aware of NASA's 
tentative plans to incorporate an MSS aboard Landsat-D in addition to the TM 
(Lynch, 1976a). It appears that such an interim measure would be effective 
until the parameters for an operational series of Landsat follow-on craft can 
be finalized (see Colvocoresses, 1977). 

This should not be construed as intending to limit further development of 
multi spectral scanning systems; rather, that future spectral coverage assign- 
ments should consider the operational use being made of existing spectral bands. 

Beyond the concern for data continuity, the conference did not address such 
questions as optimum spectral partitionings, etc. 
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TEMPORAL RESOLUTION 


The conference discussed temporal resolution both in terms of the overpass 
interval of the satellite, and in the broader contexts of timeliness of data 
delivery, and temporal breadth of the archival data base. 

RECOMMENDATIONS 

* NASA should provide for a more frequent overpass interval than the 
current 18 days, either by utlizing different orbital parameters or 
by the recent practice of staggering the orbits of 2 or more identical 
satellites. 

* Multiple coverage of path overlap areas should be retained. 

* Turnaround time on orders of digital products should be drastically 
reduced, certainly to no more than 7 days from satellite passage 
and preferably within 48 hours. 

* Data archiving should permit preservation of good imagery of areas 
where cloud cover is a severe problem. 

DISCUSSION 

More frequent overpasses 

Many users found the current overpass interval of 18 days too infrequent for 
a variety of disciplines, either for events which have a higher temporal frequency 
(e.d., meteorology), or for low-frequency events of acute intense magnitudes (e.g., 
hydrology: floods, snowfall). To some users it seemed that the most appropri- 
ate solution to this problem would be several Landsat type satellites whose inter- 
vals were staggered to provide a higher coverage frequency. There were no sug- 
gestions as to how to deal with the correspondingly higher data volume and sub- 
sequent archiving problems, nor any agreement on what might be an optimum over- 
pass interval. This appears to be a problem which needs more input from a greater 
spectrum of the user community. 

Coverage in path overlap areas 

Some users expressed concern that a system which permits image accession by 
specification of the relevant US6S quadrangle, or by some other geodetic reference, 
could lead to the arbitrary assignment of a geographical location to a single over- 
pass. For areas which lie in the zone of overlap this might lead to half or more 
of the data for that location being discarded. Users, especially those at high 
latitudes who have come to rely on multiple coverage due to overlap, recommended 
that any accession system allow the specification of one or any combination of 
overlap imagery for a specific location. 


Turnaround time 


Almost all users felt that the temporal resolution of the data in terms of 
overpass interval was irrelevant when compared to the time spent waiting for an 
Imagery order to be filled. Waits of 3 months or more for a CCT order were not 
uncommon. It was generally felt that standard products should be available to 
the degree that time spent in transit would be the single greatest factor between 
an order's being received by EDC and Its delivery to the user. On retrospective 
order* it was conceded that a longer wait was justifiable; however, delays on the 
order of months were judged extreme by anybody's standards. The comment was made 
that current users would "sacrifice anything for timeliness"; however, others 
countered, from experience in providing their own rapid turnaround, that there 
was a point at which users would prefer to wait if the overall quality of the 
image would be impared. A specific recommendation which arose in connection with 
this problem, and received moderate support, is that EDC should establish a pri- 
ority system regarding availability of imagery immediately following an overpass; 
this recommendation received support from users who felt that the current speed 
with which an order is filled has no discernible relation to the urgency of the 
user's need. 

Archiving 

There were some users who operated in areas where factors such as frequent 
cloud cover, or dependence upon the satellite's limited on-board storage capability, 
prohibit the obtaining of acceptable imagery with any degree of regularity. These 
users were concerned that, should data volume considerations result in a shorter 
overall archival period, imagery which was old but still useful by virtue of its 
relative rarity would be discarded. It was urged that such problem areas be i- 
dentified, and that the archival period of acceptable imagery of these areas be 
suitably adjusted. 


- 50 - 


The structure of the data on a CCT Is of particular importance 
to all users. The tape format is especially critical for I/O considerations 
and related computational costs to users processing Landsat data. In 
addition, various techniques used by NASA for the data stream and for more 
efficient data storage have an Impact on users In the areas of data reliability 
and software and hardware development. In light of this, the following 
recommendations were made. 

RECOMMENDATIONS 

* Three logical structures, BIP (band interleaved by pixel), BIL 
(band interleaved by line) and BSQ (band sequential) should be 
available. The current structure, BIP2 (band interleaved by pixel 
pairs) Is considered the least useful format. 

* The logical record structure of the image file should include 
substantial annotation for each record. 

* Physical blocking of data on a tape should provide for I/O 
buffer limitations of minicomputers. 

* Any data compression techniques employed should be reversible 
to the level of sensor noise. 

* An appropriate Federal agency, probably EDC, should distribute 
Fortran I/O subroutines for performing common manipulations of 
Imagery in whatever formats are ultimately supplied. 

* 1600 8PI CCT's should continue to be the principal mode of 

data exchange until higher-density storage devices become more wide- 
spread. 


DISCUSSION 

Users at the conference were divided as to the optimum interleaving format 
for the image matrix. There was, however, good agreement that the currently 
used system of band interleaved by pixel pairs (BIP2) was the least con- 
venient possible format. Listed below are the four possible formats for a CCT: 


S 1 S 2 S 1 S 2 S 1 s 2 S 1 s 2 S 3 S 3 S 4 S 3 S 4 S 3 S 4 S 5 S 6 S 5 
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BSQ (e.g. t VICAR format) 

si- ...... s n 
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Band interleaved by pixel (BIP) is the ideal format for classifications. 
Minimal bit manipulation is required and reformatting is not usually required. 
Band sequential (BSQ) is ideal for single band analysis and most image generation 
devices. It is fairly complex to reformat from BIP to BSQ, but the band inter- 
leaved by line (BIL) format is the intermediate format. From BIL it is fairly 
easy to reformat to either BIP or BSQ and is therefore an adequate format for 
either classification, single band analysis, or image construction. The BIL 
format allows unformatted reads and requires only a modest amount of core for 
reformatting. 

Of the four formats illustrated BIP2 is the least flexible. Regardless 
of the application, data formatted in BIP2 must be reformatted. BIL is the most 
flexible and if one format is to be selected as the standard, users felt that it 
should be BIL. The consensus of the conference was, however, that it should be 
possible to get data in each of the three previously mentioned formats (BIP, 

BIL and BSQ). 

While a truly "universal" tape format was acknowledged to be an unreal- 
istic expectation, it was suggested that whatever format Is ultimately adopted 
would be considerably more flexible if it incorporated annotation for each data 
record. Using this scheme, video data could be interspersed with character data 
(e.g.. Image and descriptive text); raster and polygon formats could be included 
in the same file; and image scenes could be of arbitrary dimensions. This "re- 
cord-comprehensive" annotation would also be amenable to the construction of 
scene and tape directories so that users could directly access only those por- 
tions of a tape that were of immediate Interest. 

Many users expressed concern that digital data is often placed on tape in 
the largest possible physical blocks. Whether the motive is to reduce I/O's or 
to squeeze more files onto a single tape, this practice often results in a tape 
which is unreadable to users of "mini" computers, where large I/O buffers con- 
sume an inordinate amount of main memory. It was recommended that users at 
least have the option of specifying block sizes as small as 4K (4096 (8-bit) 
bytes) without cost penalty. Smaller block sizes should also be available, 
even though this would require a single scan line to span several blocks. As 
the user community increases in size the reliance on minf-computer systems for 
image data analysis will also Increase. On the other hand, memory for mini- 
computers is likely to become increasingly less expensive. 
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Concern was expressed that Increasing data volume would result in the 
application of data-compression techniques that were not fully reversible. It 
w^s argued that deployment of sensors whose data rates required irreversible 
compression would tend to negate any resolution gains such sensor systems might 
exhibit. It was agreed that any data compression techniques employed by NASA 
bfe subject to user scrutiny, and their effect on the nominal quality of the data 
be fully documented. 

Along with the agreement that a single optimum tape format for video data 
phobably did not exist, there was agreement that one of the single highest costs 
to digital image users is that of reformatting the data until it Is digestible 
t ! o their particular computer system, regardless of whether the data has been 
rectified, how it is framed, etc. This Is basically an I/0-1ntensive program- 
ming problem, which to be solved efficiently often requires a level of program- 
ming skill which the typical user does not possess. Recognizing Fortran as the 
standard cal' lg language, it was recommended that NASA, EDC, or some other ap- 
propriate Federal agency, make available a basic set of I/O subroutines which 
have been tailored to whatever image formats are ultimately adopted, as well as 
to formats (CCT's, Digital Terrain Tapes) which are already in widespread use. 

While users expressed Interest in high density modes of data transfer 
(e.g., 6250 bpi digital tape, optical discs, etc.) it was generally agreed that 
the widespread application of this technology at the end-user level is still a 
few years distant. Some concern was expressed for the significance of bit errors 
associated with current HDDT's {high density digital tapes); it was recommended 
that these be evaluated statistically to the user community's satisfaction be- 
fore high-density devices become a significant mode of exchange. Most users 
prefer the current 800 and 1600 bpi tapes for end-user distribution, while ac- 
knowledging that much higher data densities will be required at the archival 
and interagency bulk-transfer levels. 


DETERMINATION OF USER DEMAND. PRICING SCHEDULES 
AND ABSORPTION OF PREPROCESSING COSTS 


Assuming that both data and software are to be provided as suggested 
in earlier sections of this report, it is necessary to determine the level 
of user demand, the pricing structure for these products and the degree 
to which NASA and/or other federal agencies will absorb preprocessing 
costs. This section addresses these complex questions. The recommendations 
are followed by a discussion of the points which have led to their adoption. 

RECOMMENDATIONS 

* Better estimates of the needs of the users of satellite data 
are required. The needs of state and local governments for 
several levels of processing of the data, and for various 
software implementations, seem poorly understood. NASA, or 
some other federal agency, should undertake to provide more 
detailed estimates of these needs. 

* Both the demand and the pricing schedules should be determined 
by an analytical technique such as the demand revealing pro- 
cesses (discussed below). These determinations should be made 
for various product combinations and should be performed on a 
regularly scheduled basis. 

* A portion of the preprocessing costs should be absorbed by the 
federal government, while the variable costs should be re- 
covered from the aggregation of users. This federal investment 
will maximize the benefits obtained from the use of satellite 
data by overcoming the threshold necessary to initiate the 
diffusion of a technological innovation. In economic terms 
the federal subsidy should be viewed as an investment aimed 

at maximizing the present value of the stream of net benefits 
obtained from the use of satellite data. 

* NASA, the EROS Data Center of the U.S. Department of the 
Interior, and such other federal agencies as should properly 
be Involved, examine the division of responsibilities and 
costs between them, in order to implement the preceding 
recommendations . 

DISCUSSION 

A variety of beliefs and opinions concerning user demands, pricing 
structures and preprocessing costs were expressed at the Workshop. However, 
a firm consensus was apparent among the participants. In the following 
discussion, the main categories of concern are isolated and the diversity 


of opinions Is represented. Following this, the consensus Is summarized, 
and finally a solution to the problem of determining user demands and 
pricing structures Is presented. The solution is an attempt to incorporate 
the consensus Into a sound economic structure. An extended discussion of the 
solution may be found In Appendix C. 

Ranges of Belief Concerning User Demands. Pricing Structures, and Pre- 
processing Costs . 

On t the Identity of the Users of Concern : 

1. The users of concern are agencies of state and local governments. 

2. The users of concern are those of 1. above and private sector 
users. 

3. The users of concern may be viewed as a pyramid, with decreasing 
technical sophistication towards the base. 

4. Many potential users are not presently actual users of satellite 
data. 


On the Coate to Present Users of Satellite Data : 

1. Many users, especially agencies of state and local governments, 
face a number of significant costs in using satellite data. 
Among these are the direct costs of obtaining the data, the 
costs of personnel training, the costs of software development, 
the opportunity costs resulting from duplication of effort, 
the costs of data processing, the costs of delay In obtaining 
the required Information from the data, and the costs of un- 
certainty, particularly concerning the continuity of the data 
products. 

2. The direct costs of obtaining the data are (at present) 
reasonable. 

3. All costs apart from the direct costs of 2. are very high, and 
inhibit a significant number of potential users from employing 
satellite data. 

4. A small number of users are not deterred from using satellite 
data by the costs of 3. 

On the Benefite From Use of Satellite Data: 

1. There are potential net benefits, not currently accruing, to 
be had from the use of satellite data by users of concern. 

2. Multiplier effects for the economy exist with respect to the 
adoption of satellite data processing by the users of concern. 
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3. Net system benefits are not maximized by the current provision 
of data. 

On the Demands of Users: 

1. The demands of most classes of users are Insufficiently well 
known and should be investigated. 

2. Knowledge concerning user demands should be continually 
monitored. 

3. A full cost/benefit analysis of user demands would be extremely 
difficult and costly to Implement. Alternative methods of 
assessment of demand should be used. 

4. There should be sensitivity to local variation in user demands. 


On Federal Government Alternatives Concerning Demand and Supply: 

1. NASA should .not be involved in the question of both determining 
and satisfying user demands. 

2. NASA should be Involved in the question of both determining 
and satisfying user demands. 

3. Some agency of the federal government should be involved in the 
question of both determining and satisfying user demands. 

4. Regional centers should be set up both to determine and to 
satisfy user demands. 

5. Centralized processing and distribution of data and software 
would maximize economies of scale. 

6. Regional centers would be more responsive to the local needs 
of users. 

7. Both training centers and information dissemination offices 
should be set up by the federal government. 

On the Federal Government Alternatives Concerning Pricing Structures and 
Cost Absorption: 

1. The federal government should act to maximize system net 
benefits. 

2. The agency concerned with supplying data and software should 
use full cost recovery. 

3. The agency concerned with supplying data and software should 
recover the front-end costs of setting up the processing and 
distribution system. 

4. Direct grants should be made to users to encourage their use 
of satellite data. 
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5. In determining the pricing structure of products, the agency 
concerned with distribution should take account of the public 
goods characteristics of data and software. 

6. The price structure of products should be uniform for all users. 

7. The price structure of products should not necessarily be 
uniform for all users. 

8. The federal government subsidy should be viewed as an invest- 
ment to optimize the present value of the stream of net benefits 
from satellite data use. 


The Consensus of Beliefs Concerning User Demands , Pricing Structures and 
Preprocessing Costs: 

There was a high degree of agreement among participants concerning 
the user demands, pricing structure and preprocessing costs. The consensus 
of beliefs is now summarized. 

1. A significant set of users are the agencies of state and local 
governments . 

2. These users of present satellite data products Incur severe costs 
In processing the products. 

3. System net benefits are being lost due to lack of adoption of 
satellite data products. 

4. The demands of users should be investigated and monitored. 

5. Centralized processing and supply of data products and software 
should be Implemented in order to take full advantage of 
economies of scale. 

6. At least part of the front-end costs of setting up a system of 
data and software supply should be borne by the federal govern- 
ment agency. 

7. Variable costs of data and software supply should be borne by 
the users'. 

8. Some users should be subsidized in their use of data and software 
products relative to other users. 

9. The overall aim of the federal government should be to provide 
data and software with a pricing structure that would maximize 
system net benefits over a period of years. 


A Proposed Solution to the Problem of Determining User Demands and Pricing 
Structures 

It is proposed that Demand Revealing Processes be applied to provide 
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a solution to the problem of determining user demands and price structures 
for various combinations of data and software supplied by a federal govern- 
ment agency to other Federal government and state and local government users. 
The theory and application of the Processes is presented in detail in 
Appendix C. 


As applied to the present problem, Demand Revealing Processes are 
a simple and elegant device whereby federal, state, and local government 
agencies are induced to reveal their "true" preferences, in terms of 
willingness to pay, for various combinations of data and software products. 
Given a required degree of cost recovery for the central agency providing 
the products, the Demand Revealing Processes indicate the mix of products 
to be supplied. The pricing structures take account of differential will- 
ingness to pay among users, while changing preferences may be taken into 
account by continual application of the Processes. The combination of 
Demand Revealing Processes and the supply of data and software may be 
viewed as an optimal investment strategy regarding the application of 
satellite imagery by the agencies of state and local governments. 

SUMMARY 

NASA, or some central government agency, should apply Demand 
Revealing Processes in order to determine the true demands of state and 
local governments for satellite data and software products in a continually 
updated manner. Such processes would allow varying degrees of cost 
recovery in relation to supplying various mixes of data and software, 
while users would face charges directly related to their "true" willingness 
to pay, despite the public goods nature of such satellite products. 

While a portion of the preprocessing costs should be absorbed by 
the federal government agency, the application of the Demand Revealing 
Processes and the provision of various combinations of data and software 
should be viewed as an optimal investment decision, whereby the present- 
value of the net benefits of satellite data use is maximized. 


SOFTWARE PORTABILITY 


In the preceding sections recommendations were made on development of 
standard format(s) for digital image products, and on absorption of most of these 
front end costs by federal agencies as part of a "seeding" program to facilitate 
their use. Recommendations were also made on the development of software'— to 
fully use these data— as part of geographical information systems. The software 
developed for these purposes should be as fully portable as possible, for a va- 
riety of computers. The present section addresses existing problems of lack of 
portability, and the resulting immense burden on users, because of the idiosyn- 
cratic development of software without agreed-upon standards. Some preliminary 
recommendations for standard procedures are given and discussed. The establish- 
ment of Image Processing and Analysis Software Standard Procedures is recommended 
for NASA and EDC consideration. 

RECOMMENDATIONS 

* Where possible, programs for general distribution should be written 
in FORTRAN. Departures from the ANSI X3.9 1966 Standard should be 
isolated in subroutines and clearly documented. Especially, pro- 
grams should isolate all input/output and bit manipulation portions. 
Implementation of this recommendation requires that programmers 
read and become familiar with the standard. A new "Draft Proposed 
ANS FORTRAN" is in circulation, but it will probably be several 
years before compilers meeting it will be widely available. 

* It is recommended that NASA and EDC develop Image Processing and 
Analysis Software Standards to ensure that future software develop- 
ment is in as internal ly-consistent a form as possible, and to ensure 
that portability is enhanced. 

DISCUSSION 

Like many other present-day scientific endeavors, use of digital satellite 
data requires a large exoenditure of time on the development, conversion, de- 
bugging, maintenance, and operation of computer programs designed to manipulate 
these data. Moreover, many of these programs are available for transport to 
other users, often at nominal cost. The reason for this is that many of the 
programs are in the public domain, having been developed by state or federal 
agencies, or by University researchers funded by state or federal grants. Those 
in the private sector are also concerned with program portability, and this con- 
cern will increase in the future, along with the growth in numbers of commercial 
purveyors of image processing software. The latter are expected to engage in 
practices similar to present software houses (which sell programs for mathemati- 
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cal and statistical analysis) and also sell their image processing and analysis 
software. 

At present the costs associated with software conversion are enormous, and 
they are frustrating because they are commonly under-budgeted. In the Geography 
Department at the University of California, Santa Barbara, it Is currently esti- 
mated that 80 percent of staff programming effort is expended in converting and 
debugging programs which at one time were running on other systems. It Is clearly 
to everyone's advantage if these conversion costs can be decreased, and this 
section discusses ways in which software portability might be enhanced. (Brown, 
1976; Aird et al., 1977). 

The FORTRAN Language 

For a variety of reasons, this document considers the portability problem in 
the context of using FORTRAN as the major computing language. This is perhaps 
unfortunate, as FORTRAN is by today's standards an inelegant and cumbersome lan- 
guage with some severe disadvantages, which are examined in the following pages. 
However, a preponderance of the existing code used to manipulate digital satel- 
lite data is in FORTRAN, and this is likely to be true of most such software 
developed in the next decade. Moreover, if any language is to replace FORTRAN, 
it must certainly be able to incorporate FORTRAN subprograms with a minimum of 
conversion effort. 

FORTRAN does have many advantages. It is widely used and widely taught and 
many subroutines for mathematical and statistical analysis exist. Compilers are 
available for almost every type of computer. Because of the minimal requirements 
for information about a subprogram required by a linkage editor or loader, it is 
usually easy to incorporate machine code subprograms into a FORTRAN program, and 
it is also usually feasible to incorporate FORTRAN subprograms into programs writ- 
ten in other languages. 

Because of its wide use on a variety of machines, some agreement about what 
constitutes FORTRAN is necessary. To this end the American Standards Association 
(ASA) began to define a standard language in the early 1960's, completing the 
work in 1966 after the organization had changed its name to United States of 
America Standards Institute (UASAI). The name has since been changed to American 
National Standards Institute (ANSI), and the standard language is now called 
ANSI X3.0 1966 FORTRAN (ANSI, 1966). Most major compilers accept any program 
conforming to the standard, but many include additional features. At present a 
draft of a new standard FORTRAN is in circulation (ACM, 1976) and will probably 
be approved as the new standard, with modifications. It will probably be several 
years, however, before most compliers conform to the new standard. 
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Problems with FORTRAN and its Portability 

There are many reasons why FORTRAN would perhaps not be the language of 
choice for manipulation and analysis of digital satellite data, were It not for 
the large corpus of pre-existing programs and programmers familiar with the lan- 
guage. Most compilers for the language are poor (Good & Moon, 1973; Hazel, et 
atl., 1973). Rarely do they provide adequate compile-and run-time diagnostics, 
including subsript checking, and allow for creation of optimized, efficient load 
modules. The language is cumbersome, requiring use of many statement labels, 
and structured programming is difficult to accomplish. Moreover, ANSI FORTRAN 
contains a number of limitations. which make satellite data analysis difficult 
or impossible, as well as imposing hardships on other types of programming. 

For these reasons most compilers support lots of extensions to the Standard, 
and many users write their own machine language code to perform certain other 
operations. These "extras" typically encourage machine and machine company de- 
pendence. Students learning FORTRAN at a given university learn the version im- 
plemented on the local machine. As they move elsewhere and become programmers, 
they pick up new features available on other machines. The end result is that 
many experienced and competent programmers do not know what isolates the Stand- 
ard and what does not, and may be quite surprised when they find one of their 
programs will not run under a different version of FORTRAN. Compounding this 
problem is the fact thay very few compilers flag their allowed departures from 
ANSI Standard, and therefore there is no reason or mechanism available to get 
programmers to conform to the standard (Pyster & Outta, 1978). In fact the same 
program may compile successfully on two separate systems , produce no warning or 
diagnostic messages on either , and produce different results . 

Some desirable and widely used capabilities which do not conform to ANSI 
FORTRAN are listed below (some of these are adapted from Larmouth, 1973a, 1973b). 
All of these capabilities are available on some compilers, and many of them exist 
in the draft proposed ANSI FORTRAN (1976). 

1) ANSI FORTRAN has no provision for direct access input/output (avail- 
able in the new draft). 

2) There are no bit manipulation routines or logical masking functions. 

This is a serious obstacle, because satellite data are composed of 
lots of small numbers, which are generally packed in groups into a 
whole computer word. Moreover, the fact that a byte is of different 
lengths in different machines may cause problems. 

3) There is no specific provision for string or character handling. 

(The new draft includes the CHARACTERS ength specification, but 
the collating sequence is not fixed; Sabin, 1976). 

4) There are no data structures. 



5) Recursive subprograms are not permitted. 

6) There is no provision for dynamic space allocation. (Larmouth, 

1973b, describes a way to accomplish this using COMMON.) 

7) Mixed mode expressions are prohibited (e.a. A = A + I is illegal), 

8) An array may contain no more than 3 subscripts {7 in the new draft). 

9) Subscript expressions can be no more complicated than constant * 
variabl ■ t constant (relaxed in new draft, but dimensioned variables 
and function references are still prohibited). 

10) There is no IMPLICIT or NAMELIST declaration (IMPLICIT is in the 
new draft), 

11) Implied loops in DATA statements are not supported; if X is dimensioned 
at 3, DATA X / 3*0.0/ is illegal (relaxed in new draft). 

12) A real variable cannot be assigned to a complex one. 

13) There is no REAL*8, INTEGER*2, or LOGICAL*! declaration. 

14) There are no end-of-file or error traps (there are in the new draft). 

15) Names longer than 6 characters are not supported. 

16) Do loops crossing zero or going in negative direction are illegal 
(relaxed in new draft). 

17) Expressions or array references may not appear in DO or computed- 
GOTO statements. 

18) There is no provision for use of quotes in Hollerith strings (relaxed 
in new draft). 

19) There is no list-directed input/output (but available in new draft). 

In addition, there are desirable features available in other languages which 
would be useful in FORTRAN. PL/I allows the variable declaration statements to 
specify precisely what precision (number of bits) is needed, while ANSI FORTRAN 
allows only REAL or DOUBLE PRECISION for floating point numbers and only INTEGER 
for integers. The number of bits assigned to integers varies from 16 to 60 on 
various machines, and the number assigned to single precision floating point 
numbers varies from 32 to 60. The relative error magnitude caused by a reduction 
from 60 to 32 bits is estimated at 10® (Good and Moon, 1973). Therefore, a nu- 
merical routine which runs perfectly well on a CDC 6400 machine may be swamped 
with error on an IBM 360. Converting the routine to double precision for the IBM 
is very tedious, unless provision is made in the original source code for this 
possibility (Aird 't al., 1977). 

ANSI FORTRAN allows the compiler to use either static or automatic storage 
allocation in subroutines. Most compilers use static allocation, and many of us 
have taken advantage of this fact by storing values in internal variables in 
subroutines between calls. Such subroutines may well introduce errors if run on 
a system where storage allocations are automatic. For example, according to 
ANSI FORTRAN anti to the new draft, a variable whose value is initialized by a 
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DATA statement on the first call to a subroutine, and whose value is modified 
by the subroutine, is undefined on a second call to the subroutine: 

SUBROUTINE XYZ (A,B) 

DATA 1/0/ 

L = L +1 


RETURN 

END 

The value of L at the second call to XYZ is probably 1, but may not be. 

ANSI FORTRAN does not specify the mechanisms by which parameters are passed 
to subroutines, and Waite (1976) has identified five major variations. The 
Standard prohibits redefinition of a dummy argument that is associated with an- 
other dummy argument by the CALL statement. It also prohibits redefinition of 
a dummy argument that becomes associated with a variable in COMMON between the 
calling routine and the sburoutine. However, many compilers will detect neither 
error, and there are a number of variations in the possible answers. Two examples 
follow; the values for N at the execution of the WRITE statement are unpredictable. 


N = 1 

CALL XYZ (N,N) 

Write (6, — ) N 

STOP 

END 

SUBROUTINE XYZ (I,J) 
1 = 1 + 1 
J = 2 * I + 2 
RETURN 
END 


COMMON N 

CALL XYZ (N,N+3) 
Write (6,—) N 
STOP 
END 

SUBROUTINE XYZ (I,J) 
COMMON L 
1 = 1 + 1 
L = L + J 
I = J * L 
RETURN 
END 


Procedures to Enhance Portability 

It is not reasonable to expect all programmers to adhere to ANSI FORTRAN. 
In some cases the non-standard features available are merely frills, and these 
should be exchewed completely. Some features, however, such as direct-access 
input/output, bit manipulation, end-of-file traps, etc., may be very useful. 
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even essential, to efficient programming. 

A workable solution to the resulting portability problems is to follow 
good modular programming practices, and to isolate and clearly document the por- 
tions of the program which depart from the Standard. In particular, programs 
whose input/output and bit manipulation statements lie in separate subprograms 
from the rest of the program are much easier to convert. Even if very meager 
documentation is supplied with programs (as is usually the case) the documenta- 
tion should clearly specify where the program departs from the Standard, what 
routines perform input and output, and the sequence of input variables. Sub- 
routines which perform bit or character manipulation should include good descrip- 
tions of any local routines which are used, so that the recipient can substitute 
his own. Implementation of these practices depends on programmers being familiar 
with the Standard, and there is no substitute for a careful reading of the ap- 
propriate documents (see Chrisman and White, 1978). 

The esthetic deficiencies in the FORTRAN language can be overcome through 
use of a preprocessor (such as ALTRAN, RATFOR, or STAGE2) which converts the 
source code of the program into ANSI FORTRAN, which is compiled for execution, 
or which may be transported to a facility which lacks a preprocessor. 
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APPENDIX B 


SPECIFIC RECOMMENDATIONS REGARDING LANDSAT-D 


The conference did not specifically address Landsat-D related questions, but 
It arrived at recoimendations which are applicable to Landsat-D specifically as 
well as NASA-produced data in general. These general recommendations, placed in 
the context of Landsat-D, are sunmarized below. 


RECOMMENDATIONS 

* Data from the Thematic Mapper (TM) should be available In two formats: 

- uncorrected (radlometrlcally calibrated) and with extensive annotation 

- radiometrlcally corrected and geometrically rectified to a standard 
map projection, scan line orientation, and pixel size 

* Since TM data will require some form of resampling, especially to 
coarser spatial resolutions employed by existing databases, NASA 
should encourage the development and exchange of Information on 
the effects of resampling. 

* Data compression for the TM Is probably unavoidable, but should employ 
methods which are reversible at least to the level of rated sensor 
noise. 

* The logical structure of distributed TM data should be band-lnter- 
leaved-by-Hne (BIL), with optional availability of band-sequential 
(BSQ), and band-interleaved-by pixel (BIP). 

* The physical structure of TM data tapes should be primarily 9- track 
1600 bpl, with optional availability of 6250 bpi tapes as high-density 
tape drives come into wider usage. 

* NASA should be aware of technological developments which, when Imple- 
mented by the user community, might affect the utilization of TM data, 
such as increasing availability of high-density storage devices and 
array processors. 


APPENDIX C 


A SOLUTION TO THE PROBLEM OF DETERMINING USER DEMANDS AND PRICING 

STRUCTURES 

By Robert Deacon, Terence R. Smith & David Slmonett 

NOTE : This paper was prepared after the Conference as part of a contribution to 
the Proceedings of the Second Conference on the Economics of Remote Sensing 
Information Systems , San Jose State University, January 1978. 


ABSTRACT 

While local government agencies are important potential users of satellite 
data, the rate of adoption of this technology by such users has been relatively 
low. This fact is partly due to the high costs associated with processing the 
currently available satellite data. It is proposed that a central agency supply 
a mix of products involving both data in varying stages of processing and soft- 
ware for manipulating the various classes of data. Since both the data and soft- 
ware may be characterized as public goods, it is further proposed that the demands 
of this class of users, as well as the pricing structure of the products, be de- 
termined by use of Demand Revealing Processes (DRPs), which encourage users to 
reveal their true demands. The combination of DRPs and the provision of data 
and software may be viewed as a "seeding" of the adoption process, whereby the 
net present value of the stream of benefits accruing from local government use of 
satellite data are maximized 
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A SOLUTION TO. THE PROBLEM OF DETERMINING USER DEMANDS AND PRICING 

STRUCTURES 

By Robert Deacon, Terence R. Smith fi David Simonett 

The consensus of beliefs essentially Imposes certain conditions on the fed- 
eral government that should be met in relation to supplying satellite related 
products.. Any solution to the problem of determining both user demands and. 
pricing structures should Incorporate these constraints. The fallowing section 
contains a proposed solution to the problem that Incorporates the major constraints 
as weIG as taking. Into account the economic nature of the products that should be 
supplied. The basic ideas are that Qemand Revealing Processes, as described below, 
should/ be used to determine jointly both user demands over various optional out- 
puts and a pricing structure that would be sensitive to individual user demands. 
Such processes not only make provision of data products sensitive to user demands 
and allow any required degree of cost recovery, but may also be incorporated into 
a central processing system that would take full advantage of economies of scale. 
Moreover, the implementation of such a process could be viewed as an investment 
decision designed to optimize the present value of the stream of net benefits from 
the use of satellite data. 

The following discussion of the proposed solution is structured as follows. 
First, the characteristics of the large numbers of potential users lying toward 
the base of the pyramid of users is examined. Second, some problems relating to 
cost benefit studies concerning the use of satellite data are discussed, and in 
particular, the problem that data and software are essentially public goods. Third 
a description of Demand Revealing Processes is given, relating to the determination 
of both user demand and pricing structures. Fourth, these ideas are applied to 
satellite data products and attendant problems as noted. Finally, the application 
of Demand Revealing Processes and the provision of data products and software is 
interpreted in terms of an investment decision. 

Users Near the Base of the Pyramid of Users: Characteristics and Problems. 

Actual and potential users of satellite data may be thought of as being a 
pyramid of users, with the horizontal axis representing numbers of users and. the 
vertical axis increasing sophistication. At the top of the pyramid there are a 
few extremely sophisticated users such as the large oil companies, who have the 
capability, for example, of taking the original computer compatible tapes (CCTs) 
containing Landsat image data, and processing the data entirely with their own 
resources.. Included in this group, of sophisticated users are also major research 
universities, federal laboratories with a research emphasis, and the major nation- 
al facilities maintained by NASA. Towards the base of the pyramid we may con- 
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CCTs in their original format. Rather than try to analyze the composition of 
the whole pyramid, however, we concentrate on a stratum that appears to be of high 
importance, namely the agencies of state and local governments with which we were 
concerned in this Workshop. This group is not of the highest sophistication, yet 
it is. capable of using computer data in formats which do not pose the very large 
front-end investment costs of working with the raw computer compatible tapes. 

This group of users is also sufficiently large to greatly expand the user community 
if they can be brought, at modest investment to themselves, into a position where 
they may use NASA products. The potential importance of satellite data to this 
class of users was clearly spelled out in a recent study by Eastwood et al. (1976) 
(addressing the needs of government agencies in five states). A diagramatic 
presentation of this user pyramid is given in Figure 1 in which the relationships 
between types, numbers and sophistication of users are presented. 

A second graphical presentation of users is shown in Figure 2 in relation to 
types of data supplied by NASA and the likelihood of these users to engage in re- 
search and development, or only in operational use of satellite data. Federal 
agencies are largely responsible for much of the present research either In-house 
or through funding of universities and consulting companies. Large corporations 
also take a modest share of the research with their own internal research funds. 

It is noticeable that little state funding is given to research in this area (see 
Simonett, 1976) and that virtually no local government research funding is avail- 
able. However, some development has to be carried out in some measure by all 
users except to the degree that the federal government acts in loco parentis for 
the smaller and less sophisticated users. What is important in the diagram is 
that operational use of fully processed satellite data greatly expands the number 
of users other than the federal agencies. The latter may constitute less than 25 
percent of total users. Also summarized in this diagram is the requirement for 
fully processed data to facilitate the move to operational use of satellites for 
all users. 

In the Eastwood et al . study (1976) on state agency uses of remote sensing 
they found that for the most part the data demanded by these users is easily 
mappable onto common base maps, such as the 1:24,000 U.S. Geological Survey quad- 
rangles. Second, they found that the Landsat CCTs presently available required 
a degree of analysis and research beyond the capabilities and means of most of 
the agencies. Third, net benefits would result were the agencies able to obtain 
satellite data in a preprocessed form from NASA or other federal agencies. 

Problems of Cost-Benefit Studies. Since 1965, considerable research and 
funding has been devoted to the National Aeronautics and Space Administration, 
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: he Department of the Interior and other agencies, to estimating the costs and 
enefits of various earth resource survey systems J Some of these cost-benefit 
: tudies have examined a number of applications, such as land use planning, flood 
arning, and mineral exploration, while others have focused upon a single func- 
ion such as improved crop forecasting. A few studies have attempted to compare 
he economic merits of alternative data collection modes, e.g., aircraft vs, 
atellite based observation platforms, but most have concentrated attention only on 
atelllte systems. ^ Despite considerable ingenuity and effort, many observers 
tave expressed misgivings over the figures, particularly on benefits, obtained in 
hese studies. 3 That these misgivings are well founded is demonstrated by the 
»ery wide range of benefit estimates found in various studies for similar cate- 
lories of use,^ and the resort to cost-effectiveness studies— as In the case of 
» 1974 study for the Department of the Interior— because of the hesitation in 
Precasting future uses of Imperfectly defined future products. Moreover, with few 
jxceptions, these studies have not directly addressed the question of how satel- 
lite data and information should be processed and transmitted in usable form to 
final users. 

Space Image Data, Cost-Benefit Studies and Public Goods . In large measure, 
these problems concerning the outcome of cost-benefit studies can be directly 
traced to the nature of the product studied, which is the data (or information) 
that various final users employ in their decision-making processes. Benefits 
that individual users obtain from the data produced are typically viewed In terms 
of derived demands for information as an Input to planning and decision-making 
activities. Thus, the value of the input is derived from the value of the final 
product or service produced by the user. Krzyczkowski , et al, (1971) provide a 
three-way classification that is useful in analyzing the nature of benefits of 
various types of information. According to their scheme, data obtained from re- 
mote sensing will either "yield (1) the same information as now exists, (2) better 
information than now exists, or (3) new information. 1,5 For information in the 
first category, the task of estimating benefits is simplified since it is reason- 
able to assume that the value of such ds^a exceeds current cost; thus, the 
potential benefit attrlbutat ie to a new data source is fully reflected in a re- 
duction in the cost of obtaining it. The latter two categories of information are 
more problematic. Logically, one must describe the nature of the "better" or 
"new" data, hypothesize the way in which it will alter decisions, and then attrib- 
ute benefits to the gains accruing from better-informed decision-making.® This 
sequence of logical steps is spelled out clearly in Zissis, et al. (1972, p. 82), 
in Kryczkowski, et al. (1971, p. 33), and in Earth Satellite Corporation-Booz- 
Allen (1974, Vol. 5, pp. 13-17). 
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n The last two steps In this chain of reasoning are conceptually the most 

difficult. The second requires a well developed model of the decision process 
| and its underlying goals. The third is greatly simplified if the decision- 

■ making activity pertains to a product exchanged in ordinary markets, where 

! prices can be directly observed. Given this, it is not surprising that the best 
developed methodology for estimating such benefits lies in the area of Improved 
crop forecasts. Since the farmers, processors, speculators, etc., involved in 
agriculture are private entrepreneurs, the profit motive provides a natural goal 
and characterization of decision-making. The existence of well organized markets 
for agrieui tural products implies a ready source of data on relevant market values. 

As Stoney {13?/, pp. 7-8) points out, however, this is the only area where formal 
economic models have been systematically employed. Moreover, he notes that benefits 
to the resource exploration industry, and contributions to state and local govern- 
ment may be "doomed to be Ignored because their needs have yet to be expressed in 
precise mathematical terms. In general, when applied analysts have confronted 
the question of benefits derived from actions of public agencies, or from better 
management of non-market resources, they have been forced either to rely on 
educated guesses or to term such benefits unquantifiable. 

All applications of benefit cost analysis characterize benefits as willing- 
ness to pay for the services delivered. Underlying the benefit-cost criterion 
is the well known compensation principle; if benefits exceed costs, then the 
stream of receipts could potentially be distributed among individuals so that no 
person is made worse off, and at least some gain as a result of the project. The 
presumed reason for relying upon government provision rather than simply selling 
such items to final consumers at prices that co ver cost is that the commodities 
in question exhibit public good attributes which present difficulties for any 
market-like distribution system based upon voluntary exchange. 

The variety of public information gathering services in the U.S., including 
the collection and processing of data obtained from aircraft or satellite 
platforms, are natural examples of public goods. In what is probably the most im- 
portant of such activities, the federal government collects a wealth of socio- 
economic information through various censuses; once collected, the use of this 
information by one party does not make it less available for others. Much the same 
is true of data or information collected by remote sensing techniques. True, the 
physical documents in which this information is transmitted (e.g., images, data 
tapes, etc.) are private goods, in that one person's use at least partially pre- 
cludes use by others. But, the information contained in these documents is not 
subject to this sort of rivalness in consumption. These same remarks apply, per- 
haps with even more force, to the software that may be used to analyze the data. 
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Public goods in particular, and items generating consumption externalities In 
general , are capable tff yielding benefits that may be simultaneously enjoyed by 
several users. Efficient provision of such commodities requires knowledge of the 
. demands of all consumers simultaneously. If efficient provision is to arise from 
competitive behavior, then different agents must face different prices for such 
items (see Groves and Loeb, 1975, pp. 211-212). An "incentive problem" arises 
because these individualized prices are directly dependent upon the preferences 
and valuations of individual agents, and these agents are not generally motivated 
to reveal them accurately. 

Demand Revealing Processes . Until recently, it was widely agreed that any 
attempt to base provision levels and payment shares upon reported demands for 
public goods will automatically create incentives to misrepresent preferences. In 
the last decade, however, a class of pricing mechanisms has been identified which 
simultaneously identifies the efficient level of public good provision and solves 
the problem of extracting payments from users in order to cover costs. All of 
the mechanisms discovered to date are isomorphic up to a system of fixed charges 
levied upon Individuals (fixed in the sense that one agent's charge does not depend 
on the behavior he pursues). Each mechanism Is characterized by a transfer, to 
each Individual , of the full reported net benefits obtained by all other users of 
the public good. 8 Intuitively, the latter feature induces each party to take 
' into account the benefit obtained by other consumers when reporting his valuation 
of the service. 

To ‘illustrate, let i=l . . . N be an index of users of the public good, and let Q 
' be the set of possible public consumption levels. Efficient provision of the 
public good occurs at q*, which solves 

N. /IX 

max l B. (q) - C(q) 1 U) 

qeQ i=l 1 ' 

where Bj(q) is a function relating Individual i's true benefit to the level of 
the public good provided (q), and C(q) is a total cost function. The set of 
alternatives may either be continuous or discrete, though in the present context 
a discrete set of choices seems more realistic. To Introduce the proposed 
pricing mechanism, consider the following allocation and cost distribution rule. 
Each individual will report, to a central agency, a total benefit function (or, 
alternatively, a demand function) for the service in question. In the case of a 
discrete set of alternatives, this would merely be a schedule of the individual's 
maximum willingness to pay for each of the alternatives under consideration. The 
"center" will process these messages and produce the level of service that max- 
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imizes the total of reported net benefits. In addition, it will transfer to indi- 
vidual j the following amount: 

t B.(q) - C(q) - Aj. 

m 

B^(q) is the reported benefit function of individual i, A. is j's lump sum charge, 
and both of these are assumed to be independent of the function that j reveals. 

From (2) it is clear that the transfer may be positive or negative (i.e., a charge) 

depending upon the magnitude of A*. In most specifications, the fixed charges (A.) 

J J 

are sufficiently large to make the transfer negative for most consumers. 

Individual j's total reward consists of his personal benefit, B. (q), plus 

J 

the transfer in (2). Since the center selects q to maximize total reported net be- 
nefits, j's total reward will be maximized only if he reports his true benefit func- 
tion to the center. To induce honest reporting of benefits by alX parties, the same 
type of transfer offered to j is made available to all_ individuals in the group. 

Two features of this general class of Demand Revealing Processes are note- 

g 

worthy . First, even though a particular user's public good consumption level and 
cost share depend upon the benefits reported by all agents, the individual's best 
strategy is to reveal his true benefit function regardless of the behavior of others. 
This dominance property is stronger than that which exists in an ordinary Nash equi- 
librium, where honest reporting is the best strategy only if all others report hon- 
estly as well. Further, given the structure of incentives, the center can quite 
naturally assuem that all individuals are reporting true benefit schedules; it has 
no particular reason to attempt to estimate benefits itself. Individual mistakes 
and inaccuracies may of course arise, but unless there is some compelling reason to 
believe that the center can avoid these more efficiently than the individual user 
can, the center is justified in taking the reported benefit schedule as datum. 

In the absence of lump sum charges, any central agency using this pricing 
scheme would obviously encounter a deficit. It is to solve this apparent problem 
that systems of fixed charges have been proposed; actually, these charges are not 
an integral part of the incentive structure embodied in the pricing mechanism. Among 
the charge systems proposed by various authors, the one developed by Groves and Loeb 
(1975) has two attractive features. First, it guarantees that the center will not 
run a deficit; i.e., in sum, the "transfers" made to individuals will be negative 
and sufficient to defray the cost of provision. Second, even though payments by in- 
dividuals fully cover costs, each party is assured some non-negative benefit from 
the outcome; no individual will be made worse off as a result of the allocation pro- 
cess described here. 
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The Groves-Loev system of fixed charges is parameterized by two fJ dimensional 
vectors q^. . . ,q N which represents the amount of the good each party would have 
provided in the absence of coordination, and , a set of signed "cost 

shares" that sum to unity. To ease exposition, assume that the per unit cost of 
the item is a constant, c. Given these definitions, the Groves-Loeb fixed charge 
for Individual j is 


c<L - t,cq + max 
3 3 qcQ 


i 

i * 

Jj 


t Bj(q) - (1 - tjcq 

m 1 3 


q 


(3) 


where q = q^ , the aggregate amount of the item provided in the absence of coordi- 

nation. It Is straightforward to show that such a system of charges yields a surplus 
(or, possibly, budget balance) for the center. Combining (2) and (3), the transfer 
to party j Is 

- max I z B. (q) - (1-t 
qeQ lift 

+ tj(cq - cq*) - cq^ 


T . = 


2 Mq 
1 


*) - (l-tj)cq 


★ 



where q* is the level of provision chosen by the center. In (4), the sum of the two 
terms in brackets is clearly non-positive, for j=l...N. Summing the remaining terms 
over all individuals, to characterize the total transfer (T), yields 


N N „ 

T < Z t.(cq - cq*) - E cq. 
“j=l J j=l J 

N 

or T < - cq* , since E t. - 1 

j=l J 


(5) 


N 

and E cq- = cq . 


It is somewhat more difficult to demonstrate that a suitable set of positive cost 
shares exists to satisfy the second claim, that all individuals gain from the system 
as compared with decentralized voluntary provision of service. Though existence can 
be shown, general formulae for computing such cost shares have not yet been developed. 
However, Groves and Loeb (1975, pp. 224-225) discuss how the problem might be ap- 
proached in practice. 

In the context of choice among a finite number of alternatives, the system of 
charges is less complex than it might appear from (4). As the number of agents in- 
volved in the pricing scheme (N) increases, the probability diminishes that a given 
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individual's reported benefit will alter the center's choice among alternatives. 

Thus, it is likely that for most agents the two terms in brackets in (4) will be 
equal (i.e., their dollar votes won't alter the outcome) in which case their charge 
would be correspondingly simplified. The system of charges can be further simpli- 
fied if one is willing to forego either of the two features possessed by the Groves- 
Loeb scheme. The lump sum fee system proposed by Clarke (1971) drops the first two 
terms in (3) so that the transfer in (4) merely becomes the two terms in brackets 
minus t-cq*. It is obvious that this system insures a non-negative surplus for the 

J 

center, but it can no longer be guaranteed that all parties will gain when the sys- 
tem is introduced. The latter feature may well be important if membership in the 
bidding group is voluntary.^ 0 

The Application of Demand Revealing Processes to Satellite Data and Software 
Products . In the general field of earth resource survey systems, there appear to be 
a number of specific areas in which these pricing mechanisms could be employed. One 
obvious application is where the items supplied to users include software routines 
(for further data processing) which possess public good attributes. Likewise, if a 
number of users desire information on a particular area, the processing activity it- 
self yields joint consumption benefits. In both cases, a major portion of the cost 
of producing the final product is largely independent of the number of final users 
employing it. Consider, for example, the development of software and suppose that 
three alternative levels of processing designated X, Y, Z, which perhaps represent 
increasing refinement of the raw data, are to be examined. If it is known, before- 
hand, that only one level of processing will be undertaken, the list of choices pre- 
sented to potential users may simply be X, Y, Z. Of course, one of these options 
may involve no processing at all, i.e., the raw data. To allow participants in the 
process to form accurate valuations, it would be necessary to describe the general 
characteristics of the data obtained under various alternatives. Once they have 
gained familiarity with the nature of the choices, each would be asked to submit a 
list of bids representing their agency's maximum willingness to pay for each alter- 
native, with the understanding that the choice would be made and cost shares assigned 
in the manner descirbed earlier. Typically, one would not expect the group of poten- 
tial users to be homogeneous; it may be a mixture of private firms and representatives 
of local government agencies. Likewise, the users to which they put the data may 
vary, but this is of no consequence. 

If the possibility exists that more than one type of processing will be adopted, 
(e.g., X and Y) then the list of alternatives must be extended. This is essentially 
due to the fact that Demand Revealing Processes, as presently designed, select only a 
single alternative from the list. If all possible combinations are allowed, the set 
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of choices must include XY, (X and Y), XZ, YZ and XYZ; i.e., a set of mutually ex- 
clusive options that collectively exhaust the range of choices under consideration. 

If a given user finds that two fo the simple options, say X and Y, are complementary 
or substitutable for one another, then his reported benefit for the combination (XY) 
would differ from the sum of reported benefits for the two simple options. 

Conceptually, the type of process described here could be applied to other 
decisions involved in providing data to final users. For example, it might find use 
in deciding whether or not to add an extra item of hardware to a proposed aircraft 
or satellite based platform. More generally, one can conceive of its use in select- 
ing the overall configuration of instruments on future platforms. The reason for 
centering the preceding discussion around the question of processing and the final 
form data will take is that this is an area where very little information exists on 
benefits. Received benefit cost studies have largely neglected these questions; 
implicitly, it has been assumed that final consumers will find that the data fits 
their needs. 

To the extend that local agencies will be among the set of demanders (e.g., 
for land use planning, watershed management, traffic system design, etc.) the bene- 
fits they report will, in all likelihood, be voiced through political representatives, 
and benefit figures will be colored by the political process. Although representa- 
tive democracy is only an imperfect device for registering the preferences of the 
citizenry, decision making by elected officials is an integral part of the tradi- 
tional political structure. Further, "representative reporting" seems clearly pre- 
ferable to the current practice of denoting benefits from such non-market uses as 
"unquantifiable" , or of ignoring benefits entirely and falling back to cost effec- 
tiveness studies, a course shich many cost/benefit studies have in fact followed. 

If confronted with such a decision making algorithm, local public agencies may well 
find It in their interest to conduct their own benefit-cost analyses for projects 
under consideration. In this case, the benefit figures reported would be measured 
in terms of the actual decision and action the agency would take as a result of the 
new information, and would not reflect some hypothetical optimum course of action 
postulated by an independent benefit cost analysis. 

Problems in Applying Demand Revealing Processes . One could expect to encounter 
a variety of practical problems in ar.y attempt to apply the allocation mechanisms 
described here. In particular, the formulae for computing individual payments appear 
complex. Actually, this difficulty may be more apparent than real. Attempts to im- 
plement Demand Revealing Processes in experimental situations (Babb and Scherr, 1975, 
and Smith, 1976) have found the subject quite capable of making choices, once the 
mechanics of the pricing system were explained. Perhaps the only Don-experimental 
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attempt to employ a pricing system for allocating public goods was the Public Broad- 
casting System's Station Program Cooperative (SPC) (Ferejohn and Noll, 1976). The 
product involved is similar to that provided by remote sensing Information systems 
in that the total cost of producing and transmitting a particular television program 
is only slightly dependent upon the number of stations at which it is received and 
broadcasted. In the SPC case, program descriptions or pilots were distributed to 
individual stations for evaluation. The stations were then assigned prices (which 
differed by station) for various programs, and were asked to indicate which programs 
they desired to purchase at assigned prices. The central coordinator then raised 
prices up or down, depending upon initial responses, in order to cover total costs. 
The process was repeated, iteratively, on the basis of updated prices. Unlike the 
Demand Revealing Process, the SPC mechanism was not carefully designed to promote an- 
efficient structure of incentives; in fact, there was no guarantee that it would even 
converge to an equilibrium, although it did when implemented. The important point, 
however, is that individuals involved in the process were able to reach decisions 
and register choices when confronted with a rather complex allocation algorithm. 

An important conceptual difficulty surrounds the composition of the group in- 
volved in the pricing process. Within the static context of models developed to 
date, it is assumed that the membership of the group of potential users is known be- 
forehand. Thus, questions of entry and exit by individuals has not been addressed. 

By quaranteeing some non-negative benefit to each menber, the Groves-Loeb system 
partially avoids the problem of exit. However, the question of how to proceed if 
agents, initially omitted from the group, wish to join after the allocation decision 
has been made deserves further study. This may be particularly important if the item 
produced is "non-exclusive," in which case incentives to dissemble over desires to 
join the group may arise. A full treatment of these issues would require a dynamic 
treatment of these processes. 

Given these and other potential difficulties, 11 together with the novelty of 
the approach, it would be premature to suggest that Demand Revealing Processes should 
supplant existing decision making institutions in the area of remote sensing infor- 
mation system. It seems appropriate, however, to recommend continued study, and 
perhaps trial experiments, in remote sensing applications. 

An Investment Devision . The application of Demand Revealing Processes (DRP's) 
in the case of local and state government users appears to possess several useful 
attributes. First, the centralized provision of data and software should offer the 
maximal returns to scale available, yet the use of DRP's implies a high degree of 
flexibility with respect to meeting user demands. Second, such a system could be 
implemented within the present government structure, and might be viewed as an inter- 
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mediate step towards “establishing the type of distribution envisaged by Eastwood, 
et al. (1976), Third, end in -relation to the prior point, it may be viewed as a 
" seeding 1 1 mechanism whereby more and more potential users come to adopt the use of 
satellite data. This last point is now analyzed in somewhat more detail. 

Two of the cost-benefit studies {The Seasat Study by Econ, and the Earth 
Resource Survey Cost Benefit Study by Earth Satellite Corporation— Booz- Allen Ap- 
plied Research) both mention the adoption 'process by which a new technology diffuses 
throughout a society, although both stop short of any analytical treatments. A 
common analytical approach to the problem of technology transfer is by use of the 
logistic equation (see Montroll, 1974) 

d£=»P (P-P) (6) 


where p is the proportion of potential users who have adopted the innovation at 
time t, o is a rate constant and P is the total number of potential users. While 
this equation has been found to be an adequate model of many diverse systems, one 
of its main points of Interest for the current problem is that the logistic equation 
has three parameters that have useful Interpretations in terms of policy decisions. 
The three parameters are the initial level of adoption p Q , the rate constant o and 
the saturation level P. The initial level of adoption parameter may in fact be 
viewed as a seeding parameter, and gives rise to the following important implica- 
tion. If one assumes that a and P are fixed in value, the higher the initial level 
of seeding, P Q , the sooner a fixed percentage of the potential adopters come to a- 

dopt the new technology and the earlier the aggregate system comes to enjoy the ben- 
efits of the technology. Hence, one may consider the problem of finding an optimum 
levfel of seeding. 

In order to solve this problem, it is first necessary to define the net present 
value (NPV) of the stream of benefits arising from the seeded adoption process. If 
E(p q ) is the cost of seeding at an initial level p Q , and R(p(t)) is the instantaneous 
rate of flow of benefits from having a level of adoption p^ft) JP at any time t, 
then 


NPV = /“ R(p(t)) e -lt dt - E(p 0 ) 


■ ai daAP IS 


(7) 


where 1 is the relevant discount rate. 

Evidently, the level of seeding p Q that maximizes the net present value of 
benefits is either zero (a "corner" solution) or some value 0 < p 0 <_ P. In the lat- 




ter case, the optimal degree of seeding may be found by taking the derivative of NPV 
with respect to p Q , setting the result equal to zero, and solving 


dE _ f V iT dR 

3p 0 ' 1 as, 


(p(r))dT 


(8) 


for p Q . Some insight Into the solution to this problem may be obtained by assuming 
that the adoption process Is described by the logistic equation (6). Hence, the 
number of adopters p(t) at any ';ime t » 0 is given by 


p(t) = 


P 



-aPt 


(9) 


where = P 0 /(P-P 0 ) depends on the initial conditions. One may then substitute (9) 
into (8). However, it is not difficult to show that 


dR = 1 din . dR 

dp 0 aP ^Po dt 


( 10 ) 


and it follows upon integrating (8) by parts that 


1/i 


dE 

dPo 


J_ dlnK 

aP * dp 0 


" [«*)) - R(p 




dt 


(id 


But since 


1 din _ 1 

aP dp 0 " ap 0 (P-P 0 ) 


(12) 


one may write the result as 

1/1 ’ ip 0 at = f Q J^(p( t )) “ R(P 0 )]e- dt 
t=o 
P=P 0 


(13) 


For the present case, this equation determining the optimal level of seeding 
? Q has the following useful interpretation. The right hand side of (13) may be viewed 
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as the present value of the stream of benefits from the adoption process that are 
additional to those benefits if the level of adoption were held at the seeding level 
p Q . The left hand side of (13) may be interpreted as the present value of a perpe- 
tual stream of marginal costs, where the marginal cost relates to the additional 
cost per unit time initially incurred by using a seeding level p Q . Therefore* when 
the level of seeding p Q is chosen in an optimal manner, the two expressions are 
equal . 

The importance of these results for the present study follows from two facts. 
First, it has been shown that the optimal stream of net benefits from the adoption 
of satellite data use by local government agencies Is sensitive to the "initial" 
number of adopters. If this initial' number is small relative to P and if the mar- 
ginal costs of seeding are sufficiently small, then the system is losing net bene- 
fits and it would pay to encourage more users by means of Investing in a seeding 
program. Second, one may interpret the provision of satellite data and software by 
a centralized agency as a seeding process. As such* there should be an optimal 
level of provision that would result in a maximal present value of net benefits. 
Furthermore, the provision of data and software, rather than direct subsidies to 
users to encourage use as previously suggested by Simonett (1976), should prevent 
local duplication of effort. The importance of the DRP's in such a process is that 
not only could they be used to obtain information on user demand, and hence on pos- 
sible pricing sturcturas, but they could also provide Information on benefits a- 
ri sing from the use of satellite data that would indicate an optimal level of seed- 
ing. Hence, one may view the overall process involving DRP's and the associated 
provision of data and software as a process whereby optimal system benefits are ob- 
tained. 

While it is not suggested that the process of adoption of satellite data pro- 
cessing by local government agencies v/ill necessarily follow the logistic law, nor 
that the task of calibrating an approximate model in terms of p Q ,a and P is easy, 
it is maintained that there are benefits to be had from thinking in terms of such a 
model. A further point of interest is that the ultimate saturation level may even 
be positively related to the Initial level of seeding (P = P(p Q ), p,> 0) in which 
case the argument for seeding becomes even stronger. 

It seems clear that NASA could take a central role in exploring the possibil- 
ities of seeding the system by using Demand Revealing Processes to implement a cen- 
tralized system of data and software provision. Such a role could be temporary 
(since seeding is a "temporary" process), until a system, such as that envisaged by 
Eastwood, et al. (1976) were implemented. Alternatively it could persist in parallel 
to some degree with such a system. 
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SUMMARY 


NASA, or some central government agency, should apply the use of Demand 
Revealing Processes in order to determine the true demands of state and local 
jjvernments for satellite data and software products in a continually updated 
manner. Such processes would allow varying degrees of cost recovery in rela- 
tion to supplying various mixes of data and software, while users would face 
charges directly related to their "true" willingness to pay, despite the public 
goods nature of such satellite products. 

While a portion of the preprocessing costs should be absorbed by the federal 
government agency, the application of the Demand Revealing Processes and the pro- 
vision of various combinations of data and software should be viewed as an opti- 
mal investment decision, whereby the present-value of the net benefits of satel- 
lite data use is maximized. 


Footnotes 


1. Several of these studies are surveyed in Krzyczkowskl , et al. (1971) 
and In Cralb and Watkins (1977). 

2. An- example of comparative analysis Is Cralb (1977). 

3. See the discussion by Hart (1977, pp. 357,358). 

4. Recent estimates of the benefits due to Improved crop Inventory figures 
range between $.7 million to $174 million per year; see Ray and Keith 
(1977). 

5. Krzyczkowski , et al. (1971, p. 14). 

6. At- the same time, these two categories may be the most Important for 
applications of remote sensing technology. In their review of past 
benefit cost studies, Krzyczkowskl, et al. estimate that about 83% of 
"valid benefit estimates for the U.S." are attributable to better Infor- 
mation and 13% accrue to new Information. Only about 4% are represented 
by Information available now. 

7. William E. Stoney, (1977, pp. 8-9). 

8. The class of mechanisms discussed below was Independently put forth by 
Vickery (1961), Clark (1971), and Groves (1973). Relationships between 
Individual pricing schemes are discussed In Loeb (1977). 

9. For elaboration on these and other related points, see Groves and Loeb 
(1376, p. 216). 

10. A number of technical shortcomings of these mechanisms have been pointed 
out (Groves and Ledyard, 1977). If Individual benefit functions contain 
Income effects, the assumption that one individual's benefit function is 
Independent of the schedules reported by others is violated. Further, as 
with other collective choice institutions. Demand Revealing Processes are 
not immune to manipulation by coalitions. Though little empirical evidence 
has been presented, Tideman and Tullock (1976) argue persuasively that such 
problems are likely to be of small practical importance. However, in a large 
number of situations there is little likelihood that one individual's res- 
ponse will alter the outcome; correspondingly it may not "pay" the indi- 
vidual to expend the resources or introspective effort required to report 

an accurate benefit function. 

11. Under existing practices, a number of federal agencies provide information 
to the general public without charge (except possibly for distribution). 

Thus, potential users have no unsatisfied demand for the types of infor- 
mation available and attempts to obtain demand schedules for such data 
would be fruitless unless the present system were changed. In the absence 
of change, utilization of Demand Revealing Processes would appear to be con- 
fined to categories of information not currently provided. 
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APPENDIX D 


OATA COMMUNICATIONS SYSTEM 


{Modified from material provided by John R. Roberts after the conference) 


In order to achieve the goal of full sharing and management of earth 
resources data for users throughout the country, the role of data communica- 
tions must not be overlooked. The attributes, functions. Implementations, 
and future considerations of a potential Data Communications System are 
discussed below. It Is recognized that this capability already exists for 
federal agencies „ but It should be expanded to Include all users. 

Attributes 

The following are key attributes of the Data Communications System 
which would enhance the utility of earth resources data for users; 

1. Accessibility. Using a terminal, the customer would dial 
government-provided inward WATS (area code 800) lines to access 
integrated data base indexes. It may be necessary that a 
terminal be government- furnished to qualified users. 

2. Versatility. The customers may scan for data availability at 
their own pace, choose among established formats for their 
desired products, and select standard preprocessing options. 

3. Response Time Reduced. Orders could be placed from the terminal 
during the same session, rather than mailed, if suitable security 
measures are included. If not, orders could be placed by phone 
to EDC. 

4. Timeliness. If the earth resources data source agency (e.g., 

Eros Data Center) could update the data base index, the user 
could learn the availability of the latest data faster than 
presently. 

5. Correctness. Manual handling of the order prior to entry into 
a computer system is reduced. 

Data Management Functions 

The Data Communication System would enable the users to search the data 
base (Index), place orders for retrieval of data (products), and would advise 
him of billing and accounting matters. A properly designed, user-oriented 
system could provide at least the three data management functions, described 
below, in an easy-to-use, efficient manner: 

1. Search Data Base Index. The customer Inputs or indicates 
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established search parameters, such as: 

a. location by latitude/longitude of center point or corners 
or by place name (or UTM? ) ; 

b. quality, cloud cover percentage; 

c. time of year or season; 

d. spectral bands; and 

e. cartographic overlays. 

The system would provide responses in terms of product avail- 
ability (images or CCTs or HDDT's and supporting software as 
applicable), costs of services, and delivery times. Also, 
other options, such as established format selection (for tapes), 
or preprocessing could be set forth. Further, standing queries 
could be established and activated by a simple call procedure. 

2. Order for Retrieval. The customer may then request products 
which he feels suit his needs, using the same parameters as 
above, or additional options, such as scale, projections, 
alternative formats and preprocessing. Products are sent by 
mail . 

3. Billing and Accounting. Established users would have their 
own accounts, and could query their status and request billing. 


Implementation 


There are a few implementation courses to be considered. For 
instance, a central system, with redundant host computers, multi-access 
central data base, and communications (front-end) processors could be a 
viable solution. However, there may be better ways to serve the user 
community, particularly if possible overloading of such a facility were a 
concern. Another approach would be to provide regional centers to handle 
the data base index queries, order processing, and accounting. There would 
still be a central data base of remotely sensed data, with which each of the 
regional centers would have a high speed communications link for order 
placement. Reformatting and preprocessing, if required could still be 
done centrally. A further refinement of this approach would be to pro- 
vide capability at each regional center to store that region's most recently 
acquired data which would be turned over to the archival data base after 
passage of a set amount of time to make room for more recent data. Such 
capabilities as those above are within the state of the art. 

Whatever approach is taken, a prime consideration must be that all 
matters which relate to the interface presented to the user must be standard- 
ized and remain consistent in their growth. Such matters would include 
data formats, communications protocols, media formats, data structures, 
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user-oriented procedures, and data management services. Reference 1 is 
a plan for an earth resources data management and data sharing approach, 
which involves the use of the ARPANET and inward WATS lines. Thi plan. 

If implemented, would make available earth resources services, not only 
from five NASA Centers and the EROS Data Center, but from any such re- 
source connected to the ARPANET. This plan also addresses the problem of 
data formats and structures standardization and presents cost and schedule 
estimates. 

Future Considerations 

So far, the transfer of actual image data over communications lines 
has not been proposed here. For most present users, the use of the mail 
is fast enough. However, given the ever-increasing local capabilities of 
image processing terminals and the lower costs of larger memory subsystems, 
coupled with unforseen future urgencies (earthquake prediction?), such 
capabilities shoi'ld be considered for future applications. This may be 
accomplished by means of satellite communications systems, judicious use 
of change data approaches, clever data compression techniques or combina- 
tions thereof. Useful image segments could be transmitted over 50 Kbps 
lines in less than one minute. Within localized complexes (buildings) 
of large organizations a shared, video bus technique is quite promising 
for rapid transfer of high volumes of data. 

Another consideration is that as the Data Communications System 
approach described above matures, other geographical data bases could be 
incorporated with conversion to standard formats, to better serve the user 
and start the process toward fully integrated data bases in this field 
The user could experiment with their usage of this data, leading to a 
refinement of their requirements for the national data base. 
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APPENDIX E 

Background Material Presented to Conference Attendees 

(The following two essays were prepared before the conference by F.C. 
Billingsley and distributed to the attendees upon arrival. While they do not 
necessarily represent the conclusions of the workshop, they were instrumental 
in providing an initial focus to the problems later addressed by the confer- 
ence. ) 

TUFTEAUF* 
by F. C. Billingsley 

THE PROBLEM 

The number of "universal" tape formats are in existence. None of these 
have all of the required data for completely understanding the tape contents 
to the extent of being able to read the tapes and find all of the data. Most 
of these have most of the required information, but none has all. None is 
completely satisfactory as proposed. In addition, most have a mixture of in- 
formation in the headers relating both to the format and to the data itself. 
The latter makes the headers unduly complex and long. The headers are dif- 
ferent for each application and cannot be brought under format control with- 
out being extrfemely complex (to cover everything needed). 


What is considered here is a format philosophy which is aimed at accomplish- 
ing the following: 

* Provide a (viaeo tape) data interchange format which will allow the local 
user to read the tapes to determine such factors as the basic logical 
structure of the data on the tape 

* Provide a structure which is expandable, to allow the inclusion of all 
necessary ancillary data, without requiring extensive unused space 

* Provide a structure which will handle, in the same logical format, either 

cellular data (integer, real, or complex, or logical), numberical lists 
(such as geographical polygon lists), and alphameric lists (such as nominal 
data files). Inclusion of the list data causes no additional problems, 
since a list may be considered to be a line of data. These lists are of 
crucial importance within geographical data systems, as much external data 
is in this form 


* The Universal Format to End All Universal Formats 
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* Provide a structure which is useable within a local data system as well as 

for interchange* thus possibly minimizing reformatting problems for the 
users. 

SO WHAT? 

A format philosophy can be defined to hold all of the above data types. A 
polygon or nominal data list is logically equivalent to an image line. All of 
the data types require the same types of ancillary data for use: geographical 

location* scale, identifiers, date of generation, source, specific annotations, 
etc. All may be interleaved in the same ways: several spectral bands or poly- 
gon lists may be interleaved, or N or 1/N lines may be packed per logical record, 
etc. 

Polygon data may be converted to cellular form for use, to allow it to be 
used with continuum cellular data such as imagery or continuum altitude data. 
Therefore commonality of the ancillary data format is important. (Conversion 
of continuum data to be polygon form Is .usually not convenient, as that requires 
converting the continuum to a series of steps. Outlining of the various homo- 
graphic areas, and converting to polygon lists.) 

Since one or more data sets may be closely related (e.g. , multiple spectral 
bands, or stereo pairs) it is desirable to be able to reference them together 
under one directory. 

Some of the ancillary data may refer to the entire data set, and other may 
vary line-by-line. Both Types are needed. Both types may shrink to zero; the 
quantity of each must be stated. 

Information required to read the scene must be in a fixed format scene direc- 
tory. (Format number and revision, number and size of lines/logical record, number 
of data sets In the group, bytes/data unit (e.g., pixel), data packing plan 
(e.g., spectral interleaving), data type, etc.). The scene directory is the 
only record requi red to be present. 

Data set specific data may be in ancillary records. These have been found 
useful to be either the size of the directory record or the size of the data 
records. Both should be possible. 

Some projects may define that a given type of record (e.g., a fixed header 
as the first parameter record) will always be present. This is within the above 
structure, and is part of the project-specific format definition. 

Two basic data structures have been used. Both have advantages and disad- 
vantages. Either should be possible. 
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During operation, the system reads the Directory to ascertain data structure fol- 
lowing, and refers to the operating instructions (the execute command) for further 
instructions. 

The quantity of all types of records (except the directory) may be changed 
during operation, and the length and number of SPs and data may be altered. Direc- 
tory entries are modified and updated to match. 

The entire assembly may be preceeded by a Tape Directory (TO) if desired. 

The first 6 bytes are to be reserved for record ID and line count. 

For a (Cellular) data it has been found desirable to allow the cellular 
data itself to be surrounded by a border (containing, for instance, tic marks, 
grey scales, human-readable annotations, in cellular form) which would be treated 
by a film recorder as part of the image, and having each line preceeded by a left 
annotation (LA) group of bytes and followed by a right annotation (RA) group of 
bytes. LA and RA provide locations for line-specific data. 
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such as Image line number. It has also been found desirable to count cells by 
number of cell lines (LV ) and number of cells/llne (CV), even though each cell may 
contain multi-byte data. For consistency, LB and RB are also to be counted by cells, 
with the expectation that the number of bytes/cells Is the same as for CV. La and RA 
are normally alpha-meric, and are counted In bytes. 

It has also been found desirable to be able to pack P lines of data per logi- 
cal record or allow I/P lines/record. 

Because of external considerations and to allow sufficient room for additional 
entries to be defined, it is suggested that the TO directory be 360 bytes long. For 
the same reason. It Is suggested that the scene directory SD also be 360 bytes. 

NASA data transfer standard Is ASCII. This should be adhered to for all al- 
phabetic and numerical annotation unless specifically coded. Binary should be used 
for video data and (perhaps) record number. 

With these definitions, it is possible to define a Tape Directory (TO) and a 
Scene Directory (SD). 

TAPE DIRECTORY (TD) INFORMATION TO READ THE TAPE 

An isolated file consisting (normally) of one record. Since the tape set may 
contain more than one image, from several sources, no specific image data to be in- 
cluded except (optional) a set of Image names. TD applies to a set of tapes, each 
having a volume number. It Is logically possible to have more than one logical tape 
per physical tape. TD is fixed length (360 bytes) with fixed field locations. 
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Byte loc 

Field Length 

Field Definition 

1 - 4 

4 

Record # (starts at 1 after EOF) 

5,6 

2 

Record Type Code (0118 for TD record) 


4 

Record Length, this record (0360 for TD) 


20 

TD Format Control # and Revision # 


1 

Tape Volume # 


1 

Number of Volumes In set 


8 

Tape ID at User Location 


8 

Generation Location, this Tape 

Suggested: 

12 

Generation Date, This Tape YYMMDDHHMMSS 

-180 


Reserved for Future Format Control 

181-360 

180 

Permanently Uncol trolled - For Local Use 


b (blank) 

Not defined or not yet filled 


0 (zero) 

A number. Do not use zero fill for fields 
which later are to contain numerical data. 

SCENE DIRECTORY (SD) 

INFORMATION TO READ THE SCENE 


Applies to a set of logical scenes (LS) following and their parameter records. 
Each scene in the set must have the same structure. Each LS may have several inter- 
leaved/concatenated individual images. The true video data may be surrounded by 
borders. SO is fixed length (360 bytes) with fixed field locations. SD is normally 
one record, but may logically contain several records, all of the same length. 


SUGGESTED FIELD LENGTHS AND GROUPINGS ARE AS FOLLOWS : 

LENGTH 


FORMAT DEFINITION Bytes 

Record # (starts at 1 after EOF) 6 

Record Type Code (0228 for SD Record) 2 

SD Format Control # and Revision # 20 

Format Control # and Revision # for bytes 

257-360 of SD and for SPs and video 20 

First Byte of Uncontrolled Field (FBU) 4 

SCENE IDENTIFICATION 

Scene ID 20 

Location generating this scene 8 

Date of Generating this Scene YYMMDDHHMMSS 12 

Date of Last Processing this Scene 

YYMMDDHHMMSS 12 


LOCATION 


_ioc 


DATA LOCATION INFORMATION 

Code for Expanded (with EOF) or 

Contracted (w/o EOF) 1 

Number of Logical Scenes for this SO (NLS) 1 

Line Packing Factor - Number of lines per 
logical Record (P) 2 

Record Length, this record (0360 for SD) 4 

Number of SO records (normally * 1) (SD) 4 

Record length of SPI, Bytes (NSI) 8 

Number of SPI Records (SPI 4 

Record Length of SP2, Bytes (NS2) 8 

Number of SP2 Records (SP2) 4 

Record Length of SP3, Bytes (NS3) 8 

Number of SP3 Records (SP3) 4 

Number of lines. Top Border (TB) 4 

Number of lines. Video Data (LV) 8 

Number of lines, Bottom Border (BB) 4 

Number of Bytes, Left Annotation (LA) 4 

Number of Bytes, Right Annotation (RA) 4 

Number of Cells, Left Border (LB) 4 

Number of Cells, Right Border (RB) 4 

Number of Cells, Video Data (CV) 8 

Bytes per Cell, Border Left and Right (NB) 2 

Bytes per Cell, Video (NV) 2 

Total Length of a 

"Line": LL=LA + NB* (LB + RB) + NV*CV + RA 8 

A Packed Logical Record contains P*LL + 6 bytes (PLR) 

LENGTH 

DATA TYPE INFORMATION BYTES 

Date Type: Cellular ( ), Logical ( ), 

Alphameric ( ) 1 

For Cellular Data: Real ( ), Integer ( ), 

Complex ( ), Non-complex ( ) 1 


For Integer, non-complex, cellular: 

Number of significant bits 

Left ( ) or Right ( ) Justification 

For Real: Number of bytes, mantissa 

Number of bytes, exponent 

For Complex: Number of bytes, real part 

Number of bytes. Imaginary part, etc. 2 


LOCATION 
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MULTI-CHANNEL INFO FOR VIDEO CELL INTERLEAVE 

Number of Channels Interleaved 2 

Interleaving Code: Line Interleaved ( ), 

BIPN Cell int N at a time ( N)« 

Band Sequential BSQ { ) 2 

Reserved for future format control -(FBU-1) 

Permanently Uncontrolled - for Local Use. 

(LANDSAT may define some of these - 

other users look out'.) FBU-End 


FOOD FOR THOUGHT ON MAKING 
LANDSAT DATA OF MORE USE 

by F.C. Billingsley 

With the advent of digital corrections being used for LANDSAT C and the 
changes in parameters for LANDSAT D, the present definitions and concepts of stan- 
dard scenes will be changed. In addition, the large size, in pixels, of each scene 
causes problems in CCT formats and film recorders. 

Discussed below are a number of ideas which may be considered 1r order to 
make the data more usable to the digital user. It is realized that someare contro- 
versial; they are presented to provide stimulation and discussion. 

GEOGRAPHICAL PIXEL LAYOUT 

Almost all of the discussions to date have assumed that, of course, the images 
will be projected to some sort of map form. There is certainly a great utility in 
data for this form. But with the increasing usefulness of digital data analysis, the 
finesse involved in the digital analysis becomes the controlling factor. 

Defining such sensor records in terms of <j> and x has nothing to do with map 
projections as long as the values are simply expressed in digital form. Therefore, 
we must answer the two questions: (1) do we really intend to facilitate digital anal- 

ysis and digital data base symptoms? (2) if so, which users, digital or mappers, 
should have to do any subsequent remapping? 

It is proposed here that because of the digital data processing finesse re- 
quired and possible, either raw data or first-generation interpolated data be sup- 
plied to the digital user and that his format be given preference. 

The considerations for the digital user are to minimize all of the reasonably 
local mosaicing problems both along track and across track and to allow call-up of a 
given area with very simple extraction mathematics. The format should be one adapt- 


- 127 - 


able te spacecraft at various altitudes, spacecraft such as SEASAT and HCMM, which 
have widely differing orbits, and to two-dimensional data derived from other sources. 

Because of the large number of map projections and scales in use, no one map 
projection will be universally acceptable. Thus, for map making, there Is an impli- 
cit acceptance of the need for secondary projection. Does secondary interpolation 
hurt? IBM Is doing a study for GSFC to Investigate secondary interpolations and have 
come to the preliminary conclusion that a second Interpolation makes only about 1 per- 
cent difference in multlspectral classification (well within the classification ac- 
curacy noise) and only 1 to 2 digital number difference at edges. Thus, even for di- 
gital user, secondary Interpolation produces only a minor effect. For the map maker, 
it should not be visible at all. Therefore, It Is entirely rational to suggest that 
the map maker accept a set of digital data which has been optimized for the digital 
user and do whatever projection he needs from that. 

But what ground layout should be used? One school of thought holds that for 
the digital user the optimum form of data Is latitude/longitude— that is, the pixel 
lines should be aligned along latitude lines with pixel spacing equal along the earth 
surface. An alternate to this would be east-west pixel lines, with pixels spaced 
equally in longitude. Since lat/long Is still the most universal transfer reference 
format, this may turn out to be the most versatile for all users, as the projection 
equations from this to all of the map projections are readily available. 

Then, If map makers want some projection such as SOM, HOM, polyconic, or what- 
ever, they could reinterpolate from the first order Interpolated data provided to 
digital users. The additional interpolation will be invisible to them. This reinter- 
polatton will also be simplified if the initial Interpolation is into east-west pixel 
lines; from these it is easy to produce maps whose edges are east-west-north-south, 
thus corresponding to the available maps amyway. 

PIXEL SIZE 

For digital processing, pixel size determines that amount of data to be handled. 
However, most users will eventually make pictures from the data. With Incoming pixels 
in the 30 m IFOV range, operating with pixels larger than this will reduce the data 
resolution for both the analysis and film recording. Also, very few film recorders 
have other than "round number" pixel spacing (typically 25, 50, 75, 100 ) available. 

Producing maps at standard scales is desirable, although map projections will still be 
a problem; keeping pixels small will minimize further data loss in the required inter- 
polations. 


CALIBRATION 


With the Inherent slow change of Instrument parameters and the much better at- 
titude control of the spacecraft on LANOSAT D, and even better opportunity than now 
will exist for precise calibrations over entire swaths. The ability to use long term 
averages should minimize the intersensor striping caused by calibration noise and 
should produce a better model of spacecraft motion, allowing better geometric correc- 
tions. Thus, consideration should be given to calibrating an entire swath before 
breaking Into scenes. This should appreciably simplify the calibration procedures, 
which now start over each frame, and produce more frame-to- frame consistency. Then, 

If this Is done, there Is not reason to break the bulk data Into frames for arch vlng, 
thus further simplifying the NASA processing. Some framing capability will still be 
needed for local film production and quality control. 

ARCHIVE RETRIEVAL 

If full geometric and rediometric calibration can be done by swath, the archive 
need only to locate a segment of data for reproduction and dissemination. This allows 
much more freedom In choosing the segment: e.g., to output a specific county or tract, 
on request, as well as standard frames. In the coming days of digital data bases and 
digital operations, the best data compression for the users will be data avoidance-- 
the archive can be a tremendous help in this by sending the users only the requested 
areas, which may be less than the standard frames, or may cross standard frame bound- 
aries. Thus, the swath storage. Eventual switch to video disc will speed the retri- 
eval, as access will be much more rapid than running down an HDT to find a segment. 

Continued conversations with various users strengthens the belief that most 
Jata requests within a digital data system will continue to be by 1 at/long of UTM co- 
ordinates. The archive can expect to receive requests by blocks bounded by geograph- 
ical coordinates, and the users at their consoles will address the data in the same 
way. Therefore, digital data for use in such systems should be in such a format as 
to enable easy data manipulation in these desired reference system. This, also, is 
facilitated by swath storage. There is enough overlap in the orbit tracks to allow 
the extraction of continguous sets of square frames aligned with lat/long or UTM — the 
archive should consider these as possible standard products. 
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